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ABSTRACT 

A one-day seminar was convened on November 4 , 1^69, 
to consider problems involved in communicating scientific information 
and the progress that has been made in improving the scientific 
publication mechanism. Thirty participants representing different 
types of information producers, users, and organizations contributed 
to the seminar discussion. Technologies that have facilitated 
machine-readable data bases were found to have materially facilitated 
the preparation of a variety of information products that are needed 
for both large collection and special interest oroun purposes. 
Additional research and lover costs are considered necessary to 
permit the extensive exploitation of time-shared, computer-based 
networks, but current hardware capabilities indicate that direct us<>r 
involvement at individually accessible consoles is possible and is 
expected to materially aid the development of well-organized, 
user-oriented information bases. (Author) 
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STUDY OBJECTIVE 



Questions of how to Improve the handling of the burgeoning 
literature of science and technology, and hovf to ensure the speedy 
communication of information about research and development. (R & D) 
results to those that need them, are receiving Increasing attention 
In many quarters. These questions have Immediate Importance to those 
who use, or should be taking advantage of, current Information in their 
work. They are also important to R & 0 planners, managers, and budget 
officers whose projections are Indirectly affected by the availability 
of information and the costs in time and manpower, as well as dollars, 
that must be incurred In Its acquisition. Publishers and Information 
service suppliers have been seeking and developing new approaches to 
packaging and disseminating information not only to gain new markets 
but also to alleviate mounting production costs. 

Moreover, how Information Is organized, packaged, and used 
has implications both to education and to the exploitation of new media 
and technologies for educational purposes. Collecting and organizing 
Information so that It can be conveniently and effectively used by 
people Is a goal of educators as well as Information processors. The 
proper use of Information reflects education on the part of the user. 
Thus, Information products should, optimally, be designed to educate 
as well as Inform. 

To consider ways of attaining these goals, a one-day seminar 
was convened that was hosted by the Office of Science and Technology, 
Executive Office of the President, and sponsored by the Library and 
Information Science Research Program, 8ureau of Libraries and Educa- 
tional Technology, U. S. Office of Education. Participants were 
Invited who represented a cross-section of organizations concerned 
with matters of scientific publication, communication, and dlsseml-. 
nation. These organizations Included government sponsors of Information 
handling activities, not-for-profit producers of information products, 
academlcvans concerned with uses of Information In connection with 
economic and Industrial development, Information system designers, and 
scientist producers and users of Information. 
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SCOPE OF THE SEMINAR 



Thirty persons met on November 4, 1969, in the National 
Science Foundation Board Room to consider questions concerned with 
scientific publication. A list of the participants is given In 
Appendix I. Discussion proceeded accord'ng to the following agenda: 

Session A. At this session, participants introduced the 
activities with which they are engaged. These 
activities are summarized In brief descriptions 
that are Incorporated In Appendix II. 

Session B. This session considered the relevance of 

current work to conjectures of future needs 
at national and International levels. Par- 
ticular emphasis was given to Influences of 
current progress In science, technology, and 
education to likely future developments. 

Session C. This session considered current activities 
critically In relation to perceived gaps In 
technologies, and possible effects of tech- 
nological developments on future Information 
actl vl ties . 



Session D. This session elicited suggestions concerning 
the roles and missions that various organi- 
zations have and should have to foster the 
handling and communication of information. 
Public and private organizations, for-profit 
and not-for-profit, were Involved In the 
participants' considerations. 
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ISSUES REGARDING SCIENTIFIC PUBLICATION 



Lost — One-Third of the World's Scientific Literature is the 

title of a National Science Foundation brochure prepared in 1960.1/ 

This title dramatically encapsulates the dilemma resulting from the 
growing volume of scientific activity and technological development 
that (jave been fostered worldwide* particularly since the close of 
World War II. This booklet was primarily addressed to the language 
barrier that was estimated to cut about 95 percent of U. S. scientists 
off from nearly 30 percent of the world's literature. Less than 
2 percent of scientists had reported to the National Science Foundation 
that they could read Russian, and reading capability by more than 
1 percent of the scientific population was reported for only four 
other languages (German, French, Spanish, and Italian). 

In addition to promoting the translation of foreign-language 
materials, this brochure recommended the voluntary contribution by sci- 
entific societies, scientific institutions, and Government agencies 
that conduct translation programs of translations made by them to 
national collection centers for centralized announcement and redis- 
tribution. The brochure also advocates the preparation of critical 
reviews and state-of-the-art papers for communicating information 
about foreign science. 

The search for alternatives to the scientific periodical as a 
means of communicating information has been discussed in the literature 
for about forty years.f/ The practice of announcing the availability of 
papers through abstracts or notes in periodicals was introduced over a 
century ago in the Royal Society Proceedings. A useful analysis of the 
pros and cons of the scientific periodical was made in 1960 by Ralph 
Phelps and John Herling of the Engineering Societies Library. Among 
deficiencies, they include: 

a. delays of a year or more between the time of acceptance 
of a paper and its appearance in print, 

b. restrictions on the length of papers and other editorial 
demands that exclude historical and theoretical material 
and, sometimes, references, 

c. a scattering of papers across a multiplicity of journals 
that hampers scientists from keeping informed and libraries 
from comprehensive coverage of a field, 



1/ Lost— One-Third of the World's Scientific Literature. Washington, 
u. C.» National Science Foundation, 1960. Report NSF 60-20. 

2/ Ralph R. Phelps and John P. Herling, "Alternatives to the Scientific 
Periodical." Unesco Bulletin for Libraries , Vol. 14, No. 2, 1960, 
pp. 61-75. 
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d. poor matches between journal coverage and scientists' 
specializations that limit their interest to only a few 
papers in any periodical, 

e. the high cost of journals to both individuals and li- 
braries that limit acquisition, and, thereby, coverage, 
even by abstract services, and 

f. wasteful incursions that the system imposes on the time 
of both the scientist-searcher for information and the 
scientist-reviewer and editor (the latter often on a 
volunteer basis that distracts from work in which he 
should be engaged). 

Proposals for overcoming these deficiencies have included the following 
suggestions: 

a. substitution of the individual paper for the periodical 
as the primary unit of distribution (the most often-made 
suggestion), 

b. establishment of Information centers on national and 
international levels for centralized publishing, bibli- 
ography, reference, copy distribution services, 
announcement and abstract periodicals, etc., 

c. selective dissemination of papers automatically to li- 
braries and individuals i » accordance with registration- 
of-interest lists (as well as on an as-reouested basis), 

d. establishment of an efficient method of weeding out poorly 
written and unoriginal papers before they are published, 
possibly by a central editorial bureau of experts, 

e. coupon systems for ordering papers from abstract journals, 

f. production of abstract journals that describe papers in 
all languages in a common language, 

g. Issuance of tables of contents of periodicals from which 
desired papers could be ordered (proposed by J. D. Bernal 
In the late 1940's and first attempted by E. Garfield in 
1953), and 

h. simultaneous preparation by authors of a full report, a 
two-page abridgement, and an abstract, selected papers and 
abstracts being published in journals with all abstracts 
sent to Journal subscribers (possibly on flleable cards) 
who could then order unpublished papers of their choice. 



Phelps and Herling contend that arguments against the periodical do 
not overcome the following criticisms of alternative proposals: 

a. the tendency of more limited distribution to impose 
restrictions on scientists' reading, thereby reducing 
the value browsing and casual reading are believed to 
have on creative thinking and education, 

b. the threat of censorship inherent in centralized 
regulation of publication and dissemination, 

c. the administrative burden on distributing agencies and 
on documentation centers and libraries attendant on 
"separates" schemes, 

d. higher processing costs that could no longer be at 
least partially defrayed by the financial support of 
advertisers, 

e. loss of the skills of scientific editors who would not 
be expected to serve centralized agencies, 

f. loss of intrinsic merits of journal publication and the 
associated confidence readerships have in those that 
have gained reputations for merit, particular scientific 
Interests, etc., 

g. damage to the coherence and vitality of many professional 
societies whose raison d'Stre, at least in part, is their 
periodicals, and 

h. the minor role that publication is presumed to have in 
scientific communication, much of this having been shown 
to be effected through interpersonal exchange, Informal 
correspondence, attendance at meetings, etc. 

Although problems of scientific publication and communication 
of Information are often consider J from either the publisher's or the 
user's point of view, too often the role of the originator or producer 
of information is overlooked. Another N$F brochure contains the fol- 
lowing admonition: 

"The scientific information problem begins in the research 
and development establishments where project results are 
converted into documentary form. These information sources 
include Government agencies and their contractors, private 
industry, colleges and universities, and independent research 
institutions. Not all users are sources, but every source is, 
of necessity, a user of scientific information. "3/ 



3/ Seme Problems in the Reporting of Research Results. Washington, 0. C. , 
National Science Foundation, circa 1960. 



Some of the activities suggested to these source-users to promote timely 
and meaningful communication Include: 

a. prompt dissemination of information on current research 
in progress, 

b. better program planning and selection of papers presented 
at symposia and conferences, 

c. exploitation of new media and formats to speed the 
information flow and simplify bibliographic, storage, 
and retrieval phases, 

d. the development of new techniques for separating and 
consolidating information to facilitate retrieval and 
use of information, 

e. reporting of negative results, 

f. use of and improvement in the quality of abstracts 
accompanying papers, 

g. source indexing to guide and assist indexers of 
periodical and documentation services, 

h. the development of uniform terminology and coding 
systems to facilitate bibliographic processing, 
storage, and retrieval, 

1. use of conventional publications that provide the best 
means of making Information readily available, 

j. prompt announcement of Industrial reports and university 
theses with provisions for making them available, and 

k. reassessment of policies and criteria governing the 
application and removal of security and other limitations 
on access to the literature to facilitate information 
flow. 

On the international scene, several organizations have 
focused on problems of scientific publication and Information flow. 

These Include the International Federation for Documentation (FID) , 
the International Federation of Library Associations (IFLA), the 
International Council of Scientific Unions (ICSU) Abstracting Board 
(ICSU/AB) and Committee on Data for Science and Technology (CODATA), 
the UNESCO International Advisory Ccxr.mlttee on Documentation, 

Libraries, and Archives, the Technical Information Panel of AGARD 
(NATO's Advisory Group for Aerospace Research and Development), and 
UNISIST (a body created by ICSU and UNESCO to examine the feasibility 
of a world science information system). 
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UNESCO Working Parties have, for example, been considering 
such topics as: 

a. relationships between abstracting, Indexing, and primary 
publications, Including the relative value of Indexes and 
abstracts for users , 

b. the role of scientific periodicals and alternatives as 
ways of publishing scientific papers, 

c. the organization and functioning of abstracting services 
In various branches of science and technology, 

d. a code of good practice for scientific publications, 

e. standardization for primary and secondary publications of 
such elements as abbreviations of titles of periodicals, 
transcription of cyrillic characters, and bibliographic 
cl tations, 

f. the creation of regional, subject, and national associa- 
tions of editors of scientific and technical periodicals, 
and 

g. Improvements In the preparation and availability of 
publications resulting from scientific meetings. 

The contents of Appendix II reflect various approaches that 
have been and ai% being attempted that emanate, at least In part, from 
the deliberations reviewed above. Participants at the Seminar are 
finding that machine-readable data bases are facilitating the pro- 
duction of a variety of Information products for both archival and 
large central-store purposes and for the needs of special Interest 
groups. Analytic products In the nature of evaluative reports and 
states-of-the-art that are being developed by commercial organizations 
are proving salable with the proviso that a good marketing mechanism Is 
a necessity for informing potential users of the products. Emphasis 
was suggested on user Involvement to ensure the Intelligent use of 
Information products and Information. Technologies are making It 
possible to give users direct access to large central files and to 
enable them to restructure files In accordance with their particular 
needs from consoles that can become part of their personal office 
equipment. Machine-based networks are expected to have the effect of 
bringing members of conmon- Interest communities who are geographically 
dispersed In closer contact through the medium of their consoles. 

Hie content Improvement and enhancement are anticipated as users gain 
the ability to add comments and corrections to stored material. 
Mechanization Is also expected to enable quantitative assessment of 
use that Is made of Information and up-grading of Information stores 
through monitoring programs that need not Interfere with monitored 
users. To date, most of the Information products that have been 
developed have been Innovations of producers. The greater user 
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Involvement that Is anticipated is expected to stimulate Innovations from 
users. Increased Investigation of television, audio, and audio-visual 
technologies was suggested for the kinds of information they are well 
suited to transmit. Different types of information packages with the 
same Information content were not viewed as detrimentally duplicative 
when each served a useful and needed purpose. Consideration of 
possibilities of cost sharing by various organizations was recommended 
to spread the burden of admittedly high costs of establishing and 
maintaining needed information bases. 
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SUMMARY 



A one-day seminar was convened on November 4, 1969, to con- 
sider problems Involved In communicating scientific Information and the 
progress that has been made In Improving the scientific publication 
mechanism. Thirty participants representing different types of Infor- 
mation producers, users, and organizations contributed to the seminar 
discussion. Technologies that have facilitated machine-readable data 
bases were found to have materially facilitated the preparation of a 
variety of Information products that are needed for both large-collec- 
tion and speclal-lnterest-group purposes. Additional research and lower 
costs are considered necessary to permit the extensive exploitation of 
time-shared, computer-based networks, but current hardware capabilities 
indicate that direct user involvement at Individually accessible con- 
soles Is possible and Is expected to materially aid the development of 
well-organized, user-oriented Information bases. 
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REPACKAGING: RETROSPECT AND PROSPECT 



A memorandum Is reproduced below that has had an interesting 
history. Originally prepared by the author for Dr. F. B. Rogers to dis- 
cuss at the Gordon Conference In the summer of 1961, It was shared with 
Dr. Alvin M. Weinberg on the occasion of the NFSAIS meeting In 1962, and 
subsequently with the Heller Associates as they were preparing the "Or- 
ganization X" report It has not been revised or updated In the eight 
intervening years. It was the philosophic base of the exploitation of 
the MEDLARS data base In cooperation with need groups. MEDLARS now pro- 
duces for distribution by others 15 specialized packages from a common 
data base. 



Viewed in retrospect, sponsors of mission-oriented abstracting 
and Indexing publications have not so far broken down the doors of the 
custodians of the discipline-oriented data bases. The services them- 
selves have experimented with repackaging their own products. The work 
on compatibility and convertibility of outputs which would enable others 
to do this did not get done. 

Viewed in prospect, it is the group of secondary processors of 
tapes for SDI purposes who are accomplishing the function of melding the 
products of the large discipline-based systems. This group has made 
rapid progress in recent years. The group is international, and its 
work will have considerable impact on the future provision of informa- 
tion services. Whether it will concern itself with the generation of 
published products remains to be seen. It appears more likely that the 
information industry will fill this need. 



DISCIPLINE AND MISSION ORIENTED SCIENCE 
IN RELATION TO THE ABSTRACTING PROBLEM 



"Indexing and abstracting services traditionally 
reflect the organization of the sciences which have created, 
supported, and used them. Thus Chemical Abstracts has re- 
flected scientific assumptions 1) that there is a disciplin- 
ary area ol science and technology with a common concern for 
the constitution and reconstitution of matter and 2) that 
the practicioners, teachers, and researchers in this general 
discipline divide the field by agreed principles and organize 
laboratories, academic courses, and professional associations 
accordingly. History records that these considerations have 
conditioned the traditional organization of abstracting 
services — by scientific discipline . 
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Science Is in the middle of a revolution today. 

While organization by discipline has survival value for edu- 
cation, the frontiers of research are characterized by cate- 
gorical or mission orientation, where the knowledge and skills 
of multiple disciplines are brought to a common focus on a 
problem area. Conflicts between discipline- and mission- 
oriented research constitute a major issue in the planning and 
administration of research. Similarly, this conflict has a 
profound impact on the organization of information systems, 
the implications of which, possibly more far reaching than the 
sheer volume of scientific publication, or increased speciali- 
zation, we are just beginning to understand. 

Multidisciplinary resea/ch obtains in both basic and 
applied fields. It is characterized by ad hoc cooperation 
among the traditional disciplines toward solutions of problems 
considered socially important. This cooperation may or may not 
be enduring. Philosophically considered, multidisciplinary re- 
search may demonstrate the seamless fabric of sciences; in 
practice, it is concerned with borrowing threads from many areas 
of the fabric to reweave them into a new design. 

The information requirements of multidisciplinary 
research are critical. Once a categorical problem has been 
identified and researchers from multiple disciplines assembled, 
two basic needs are expressed: 

1. To pool the knowledge of the contributing 
disciplines which relates to the problem, as a fund of common 
knowledge is essential for a common understanding. 

2. To provide a current awareness service — 
abstracts, indexes, newsbulletins, etc. — reflecting current 
contributions from all the scientific disciplines which bear 
on the problem. 

The effect of this trend is marked. First, the re- 
quisite retrospective knowledge derives from tearing down dis- 
ciplinary organized information to reconstitute it with the 
new mission orientation. Second, the current services either 
require current dependence on existing discipline-oriented 
services, or must duplicate their activities. 

The problem becomes acute when the economic dimen- 
sion is added. Mission-oriented research is by definition 
of social importance, and on the frontiers of sciences. It 
is relatively easy to find generous financial support from 
governmental or other sources. Such support, however, t urds 
to decrease support available for the traditional discipline- 
oriented services, which have acknowledged survival values. 

If this trend continues, the following consequences 
are inevitable: 



O 

ERIC 



- 22 - 



1. The discipline-oriented services will be 
placed in a critical position. 

2. The problem-oriented services will pro- 
liferate, and uncoordinate duplication of effort is inevit- 
able. 



3. The scientific disciplines dependent on 
traditional abstracts and indexes for a permanent scienti- 
fic record, or for educational purposes, will be the losers. 

A radical rethinking of this problem is absolutely 
necessary. Supporting agencies have made tentative moves 
already in response to economic pressures, but long range 
planning is absent. This planning should be in the direc- 
tion of coordination and economic inter-dependence of dis- 
ciplinary and problem-oriented systems. 

The fundamental requirements for a combination 
system which would 1) maintain discipline-oriented systems 
for education and record purposes, and 2) permit the econ- 
omic establishment of multiple mission-oriented systems as 
required, are: 



1. Coverage of the scientific literature 
must be comprehensive. 

2. This coverage must be both speedy and 

economical. 

3. There must be maximum fluidity of the 
end product of such coverage, to enable it to be unitized 
and reassembled according to any mission-oriented need. 

Computer technology! as it is applied to storage 
and retrieval of published information, ptovldes a power- 
ful instrument to bring this relationship about. The key 
requirement is that of unitizing information produced in 
disciplinary packages, manipulating it and producing it so 
that it may be applied for mission-oriented purposes — an 
operation computers are uniquely able to perform. 

The following is a theoretical outline of a 
suggested solution: 



1. Achieve an arbitrary division of the 
disciplines by supra-disclplinary groups (e.g. chemical 
sciences, physical sciences, earth sciences, biological 
sciences). Build up the existing servicea ( Chemical 
Ab&tract& % Biological Abstracts $ etc.), and create, under 
appropriate sponsorship, new ones to guarantee comprehen- 
sive coverage of world scientific publication. 
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2. Provide each such supra-disciplinary ser- 
vice with a computer facility* Work toward maximum compati- 
bility of thesauri, and convertibility of output products. 

3. Reverse the usual abstract/index relation- 
ship. Have these services prepare computer composed citation 
lists first , and superimpose abstracting operations as a sec- 
ondary activity. 



4. Encourage the mission-oriented services to 
rent screening services and listings from one or more of the 
supra-disciplinary services. These f ee-for-service activities 
could, at the customer's option, also include: 

a* full abstracts 

b. photoduplication of all listed papers 
for later abstracting 

c. contract preparation of complete mis- 
sion-oriented abstract service. 



Role of Government 



While many exceptions may be found, the research 
programs of Federal agencies are predominantly mission- 
oriented. The Smithsonian Institution, which has survived 
as a home for much descriptive science, and the NSF are two 
exceptions to a list of agencies (DOD, NSF, HEW, NASA, AEC, 
etc.) which, generally speaking, all have categorical missions. 

This is not to say that these agencies do not con* 
sciously support research by discipline, or that they are not 
involved in scientific and administrative problems Incurred by 
trying to exist simultaneously in two worlds. It is to say 
that the resolution of their organisational and support pro- 
blems are dictated by overriding forces representing subordin- 
ation to goals which society considers important. 

It follows that government sponsored adstractlng 
services reflect the missions of the agencies sponsoring them, 
and it follows further that the sum-total of govemmentsi 
scientific missions mty always be less than the totality of 
science. 



Therefore, if the proposed system is to be imple- 
mented through Federal planning and funds, it would appear 
necessary to seek authority at the level of the Office of 
Science and Technology. 
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Discussion 



This proposal offers a possibility of integra- 
ting discipline- and mission-oriented information systems 
in such a manner as to: 

1. Encourage the survival of the tradi- 
tional services, by basing pay-as-you-go problem-oriented 
systems on their productivity. 

2. Offer wide choice in the degree of 
dependence or integration, ranging from a citation screen- 
ing service, to the supply of completed abstracts. 

3. Effect marked economies in those opera- 
tions which can best be done centrally — the collection and 
primary screening of the scientific literature. Present 
duplication of these necessary preliminary steps now con- 
stitutes a heavy expense to all concerned. 

4. Satisfy both groups with promptly pro- 
duced indexes or announcement services. 

5. Satisfy both groups with comprehensive 
abstracting coverage. 

6. Provide a basis for a wide variety of 

other forms of secondary publication: review papers, 

compendia, handbooks, bibliographies, etc. 

This is a theoretical discussion of a complex 
practical problem; there are many difficulties to be 
overcome in further planning and in implementation. It 
represents, however, the philosophical underpinning of 
the MEDLARS project currently undertaken at the NLM. 
Graced success, it may be possible to test some of the 
theory in the not too distant future." 
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BURTON W. ADKINSON 



National Science Foundation 
Office of Science Information Service 
Washington, D. C. 



OSIS CONTRIBUTIONS TO PROGRESS 
IN SCIENTIFIC PUBLICATION 



The objective of the Office of Science Information Service 
(OSIS) Is to ensure that U. S. scientists and engineers have adequate 
Information services to satisfy their needs. One approach in securing 
this objective has been the promotion of new and better techniques and 
systems for handling and disseminating science Information; another Is 
the improvement of existing systems and techniques to make them more 
effective. These approaches entail a wide variety of activities. For 
example, the OSIS encourages the major professional societies In the 
various scientific and engineering disciplines to Integrate the Infor- 
mation functions and services within their respective fields and to 
cooperate with their foreign counterparts and appropriate International 
organizations. At the same time, existing Information services such as 
abstracting, Indexing, and publishing are provided with partial support 
on a temporary basis. Universities and academic research organizations 
are encouraged to develop systems to meet the Information requirements 
of scientists and engineers In their localities. Other projects we 
have sponsored include studies of communication patterns, the transla- 
tion of current foreign-language journals, conferences, and travel to 
International meetings. 

The OSIS does not Itself operate any Information service, 
reference service, or translation service. Its functions are limited 
to support of the activities of other organizations. In exercising 
this function, the OSIS Is responsible for the prudent utilization of 
Federal (l.e., the taxpayers') funds. Through appropriate review 
mechanisms, the OSIS ensures that the activities supported by the funds 
appropriated for Its programs contribute to the purposes and goals which 
describe the mission of the Office. 

Problems with respect to scientific publication, or, more 
broadly, problems with respect to the communication of scientific and 
technical Information, result to a considerable degree from a prolifer- 
ation to the point that the scientific and technical conmunlty Is now 
burdened with the wealth of published resources and seeks means to cope 
with the flood of new Information embedded In the published record. The 
response to this set of problems by the scientific and technical com- 
munity has resulted In Innovative efforts and experiments of which a 
significant and seminal number have been supported by the OSIS. 
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Program Organization 



At the present time, the organization of the OSIS reflects a 
program emphasis that ranges from research through development to oper- 
ational support. Basic research and exploratory development are sup- 
ported in our Research and Studies Program; the development of Informa- 
tion systems is supported by the Information Systems Program; and oper- 
ational support Is provided by the Information Services Program. A 
separate unit was organized to administer support for the development 
of a national chemical Information system, and a Special Projects Pro- 
gram has been given the responsibility of funding projects designed to 
improve university-centered information systems. This pattern of 
organization does not explicitly recognize the publication function as 
a discrete area for support. Other programs as well as those mentioned 
above find themselves concerned with publication support depending on 
whether the requirement is for deficit funding of an ongoing operation, 
for research In the ways publications are exploited or ways In which 
they might be Improved, for developmental efforts when the requirements 
are known, and Innumerable other conditions. 



Patterns of Support for Innovative Scientific Publication 



The patterns vary with the nature of the problem that Is ad- 
dressed. The problem set is conditioned by factors that are operative 
In the Information- transfer environment. A simplistic enumeration might 
characterize these problems as follows: 

1. Problems arising from the fact of the Increasing 
amount of published materials competing for attention of the scientist; 

2. Problems arising from economic relationships; 

3. Problems arising from technological change; 

4. Problems arising from changes In the total Informa- 
tion transfer system. 

This characterization tends to separate essentially Inter- 
related problem situations, but appears to be useful for descriptive 
purposes. Thus, the problem of developing an Innovative approach to the 
function of providing primary journal literature to scientists has been 
recognized In a number of differing contexts. One such approach assumes 
the availability of primary literature and explores the possibilities of 
Improving the secondary Information services designed to assist In In- 
forming about the availability, In organizing Information about the 
availability, and In assisting the selection and retrieval process. 

This approach entails recognition that the secondary services are them- 
selves Issued In published form and that they require Improvement in 
terms of comprehensiveness, speed, and reliability. Innovation In this 
area has Involved exploitation of computer technology to expand the data 
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base serving secondary services and to provide this data base In 
machine- readable form; It has Involved the development of specialized 
and selectively oriented Information services on the foundations of 
expanded and Improved data bases; It has Involved development of new 
products such as permuted title Indexes, citation Indexes, and selec- 
tive dissemination services, which have become conventional In a sur- 
prisingly short time; and has led to developments which anticipate In 
some Instances the displacement of traditional Ink-print publication by 
on-line communication which nevertheless embodies basic publishing 
functions. 



Examples of Innovative Efforts 



The general problem emerging from the fact of Increased publi- 
cation Involves the fact that the traditional journal seems to have 
become unwieldy as a mechanism for providing rapid and specific Informa- 
tion needed by the Individual scientist. A number of studies Indicate 
problems with respect to the low percentage of readers of the Individual 
articles, the Increasing lag between completion of research and publica- 
tion of findings, the economic burden on the Individual subscriber In 
order to maintain an Increasingly expensive dissemination mechanism, and 
other problems. 

The response to these problems has emerged In a number of 
forms. The OSIS had received suggestions that journals might be pack- 
aged to the needs of the Individual scientist. Cormunicatiom in Be- 
havioral Biology Is an experimental project In which articles would be 
pre- Indexed, reviewed by a staff of editors within two weeks of receipt, 
and processed for later computer printout or offset publication. In- 
stead of appearing In a bound-journal format, the Individual articles 
would be forwarded to subscribers on a flexible demand basis. The 
demand function Is geared to the publication of abstracts of the In- 
dicated articles together with an Index. The subscriber orders the 
desired articles, or Indeed all articles, by virtue of the published 
abstracts and Index. Since the initial formulation, the editors have 
arranged to secure abstracts of articles published In other journals, 
Include these In the publication, and provide hard copies on demand. 

The experiment Involves collaboration between the scientist-protagonist 
(a university professor), who carries editorial responsibility, and a 
commercial press. Both the university and the press contribute to the 
editorial expenses and publishing costs. The OSIS Is a major contri- 
butor to the experiment, but foresees that successful experience will 
lead to a financially self-sufficient operation. 

The American Mathematical Society has Initiated a similar ex- 
periment with OSIS support. The Mathematical Offprint Service provides 
articles from a growing number of mathematical journals to Individual 
scientists based on a combined notification and response function keyed 
to the Interest profiles maintained and updated In the society's compu- 
ter system. Interestingly enough, the Index vocabulary which Is basic 
to profile organization and management Is presently being Integrated 
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with the Indexing of the society's journals and Is also a candidate 
thesaurus for wider adoption In mathematics publishing circles. 

Economic problems beset the entire publishing industry. 
Scientific publication is particularly sensitive to economic pressures 
because of its essential nonprofit nature by virtue of the fact that 
scientific publishing Is mainly carried on by professional societies and 
academic presses. The policy promulgated by the Federal Council for 
Science and Technology has guided the funding practices of the OSIS. 
Support has been provided to Improve publishing practices, to pass 
through deficit periods, to Improve and increase circulation, to exploit 
advanced technology, and Improve overall management and editorial con- 
trols. Support for these purposes provided to ongoing operations is 
time limited. A condition of support is the availability of a plan that 
Indicates how the protagonist, the editor or publisher, Intends to man- 
age without continuing subsidy and attain essential self-sufficiency 
within a three-year period. 

The most significant technological advance affecting scienti- 
fic publishing Is the computer. Experimentation supported by the OSIS 
demonstrated the applicability of computers to Journal production. The 
single most vexing problem In computer-controlled composition as applied 
to scientific publication relates to the capability of producing mathe- 
matical and scientific formulae via clerical keyboarding. Although this 
problem has not yet been solved In a manner as to produce tho desired 
cost savings, developmental progress has been recorded In tying Journal 
production via computerized composition to secondary bibliographic or- 
ganization and dissemination. The American Chemical Society and the 
American Institute of Physics are Implementing long range plans to har- 
monize the automated production of Journals with the operation and dis- 
semination of secondary abstracting and Indexing services In both tra- 
ditional printed and nontradltlonal tape formats. 

The Implications of the computer technology as applied to 
publication and dissemination of scientific Information have not been 
and perhaps cannot be tested for some time to come. The scientific 
and technological community, especially that segment concerned with 
producing and publishing Information as well as operating Information 
services, Including all sectors of our society, is faced with problems 
such as property rights 1 n data, In copyright, and In software Invest- 
ments. These problems have barely surfaced and are not yet understood 
In their full complexity and dimension so as to permit an attack and 
effective solution efforts. The commission proposed by recently intro- 
duced legislation to study the effects of the computer technology on 
traditional copyright relationships; the efforts to study the effect of 
the computer on the rights of privacy of the Individual; the economic 
consequences of develoMng communication networks which may make tradi- 
tional publishing functions obsolete and dispose of traditional commun- 
ication mechanisms such as the journal, the conference, and similar 
accepted communication media, are all signs of the current ferment to 
which the OSIS Is sensitive. The funding of studies In these areas, and 
the cooperation with efforts to rationalize the steps required by these 
developments, Is presently engaging the attention of the staff of the 
Office. 
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At the same time, conscious of Its responsibilities to ensure 
the availability of scientific and technical Information as assigned to 
it by the Congress, the OSIS continues to support the publications 
deemed necessary by the scientific and technical community consonant 
with Its resources and program priorities. 



LAURENCE H. BERUL 



Auerbach Corporation 
Philadelphia , Pa. 



ACTIVITIES IN COMMUNICATING TECHNICAL INFORMATION 

I summarize here experiences we have accumulated as a producer 
and user of scientific publications. 



Data Base Construction 



We recently reported on an analytic study performed for the 
National Agricultural Library (NAL) that investigated the means by 
which NAL might interface Its own automated programs with relevant 
bibliographic data extracted from other mach1ne--readable data 
banks. 0) We surveyed nine data bases that represent almost seven 
million entries of potential value to NAL from 1959 to the present. We 
found that record formats vary significantly, variable-length fields 
being used most frequently to accommodate the variable formats of bib- 
liographic data. Standardization of record formats Is generally being 
regarded with a wait-and-eee attitude; the Library of Congress' MARC II 
format has created the greatest amount of Interest. With respect to 
data element representation, the CODEN for serial titles, used by four 
of the nine services, Is the most widely accepted standard. 

Generally, services In the profit-seeking sector appear to be 
evolving toward a service-center concept where the service will not only 
supply the data base but will also offer Information services such as 
SOI and demand searching to the user. Those services In the nonprofit 
sector appear to be developing an opposite approach; they supply only 
a data base that the subscriber utilizes by whatever means are available 
to him. Packaged computer programs have not been made generally avail- 
able, and users have had to develop their own search and processing 
software; three services do offer some programs to their users. 

Though progress seems likely In the area of format conversion 
and restructuring, the problem of Incompatibility among subject vocab- 
ularies still remains basically unsolved. One mode of data base inter- 
action recommended to NAL was conversion of extracted portions of other 
dat^ bases not only to NAL's own file structure, computer medium, and 
format, but also to Its own Indexing terminology, thereby permitting the 
merging of such data Into NAL's own master bibliographic data base for 
searching and/or announcement. A study of the relative convertibility/ 
compatibility of the vocabularies disclosed a total conversion rate for 
all vocabularies from 32.7 percent to 89 percent, based on a sample se- 
lection. The rates Include an Intellectual step of human conversion of 
terns that did not convert through a direct match or via associated 
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terms. The vocabularies that most closely resembled the thesaurus for- 
mat converted more successfully than uncontrolled vocabularies. 



Auerbach Publications 



Auerbach publications seek to provide information resources 
for the computer industry similar to those established by the West Pub- 
lishing Co. for attorneys and by the Commerce Clearing House for ac- 
countants and attorneys. These publications include: Auerbach Stan- 
dard EDP Reports, Auerbach Data Communication Reports, Auerbach Data 
Handling Reports. 

These loose-leaf services are believed to go much further 
toward solving the Information problem than abstracting and Indexing 
services or most mechanized information retrieval systems. The key 
to their usefulness Is the depth of Information analysis and synthesis 
that is involved in their production. It should be noted that these 
services are offered In a field In which there is no paucity of printed 
matter. However, much of It Is trivial, too poorly written, and lack- 
ing In a common organizational structure. 

The Auerbach publications Incorporate five basic features that 
contribute to their value to their user. These are the user's guide, 
the concept of performance measurement, the use of comparison charts, 
state-of-the-art reviews, and monthly or quarterly updating. The user's 
guide Is a comprehensive outline that describes the format of the ser- 
vice. A faceted decimal numbering system Is employed for rapid access 
and reference. 

To facilitate user evaluation, a method of system performance 
measurement was devised that Involves the use of standard benchmark 
problems. Each standard problem Is fully detailed. For each computer 
configuration, for example, program timings are developed by an Auerbach 
analyst that ar« then Incorporated In a chart for the service. Users 
need not have the same problem to exploit the benchmark evaluations. 



Information Marketing 



Perhaps the most Important element for the survival of a high- 
cost Information service Is Its approach to communicating Information 
about Itself that generates sales, l.e., marketing. 

We have found that quality of a service does not attract cus- 
tomers of Itself. It Is necessary to actively promote the sale of In- 
formation services. We have also found that services can be developed 
and maintained without government subsidies at prices that enable the 
financing of a reasonable In-house program of new product development. 
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We have, until now, relied primarily on the printed page for 
Information transfer, and the loose-leaf method of updating. We are 
exploring products on microfilm and data bases In the form of magnetic 
tapes, disks, or responses via a time-sharing terminal. We are also 
considering other services, for example, a software evaluation report- 
ing service, packaged training courses for EOP systems, a technology 
evaluation service, and new methods of packaging specialized libraries 
for the small and medium-size user. 
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University of Pittsburgh 

Council of Social Science Data Archives 
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ROLE OF COMPUTERIZED ARCHIVES FOR THE 
SOCIAL SCIENCES AND HUMANITIES 



Social science data banks or data archives are a new facility 
for social science teaching and research. They developed out of the 
need by many social scientists to better exploit the masses of govern- 
mental. academic, and commercial data produced and accumulated over the 
past three decades. Their creation was made possible by the capabili- 
ties of modern computing technology. Although use of these new facili- 
ties has rarely equalled or exceeded the expectations of their creators, 
they have In their brief existence been found beneficial by thousands of 
graduate students and by hundreds of social scientists. 

Basically, exlstlny archives can be characterized according to 
the scope of their data collections and the geographic range of their 
services. There are four types of archives: 1) general-purpose, gen- 
eral-services archives such as the National Archives and Records Service 
or the Social Science Research Council Data Bank at the University of 
Essex, England; 2) special-purpose, general-service archives such as the 
International Data Service and Reference Library at the University of 
California, Berkeley, and the Roper Public Opinion Research Library at 
Williams College; 3) general-purpose, local-service archives such as the 
Data and Program Library Service at the University of Wisconsin and the 
Social Science Information Center at the University of Pittsburgh; and 
4) special-purpose, local-service archives such as the Political Behav- 
ior Archives at the University of California, Los Angeles, and the Pol- 
itical Science Research Library at Yale University. (1 ) 

Presently, most of the data stored In archives consists of 
copies of computer-usable collections prepared by various social re- 
search organizations worldwide. An Increasingly >*.1gn1f leant proportion 
of holdings Is beginning to consist of a variety of data--h1stor1cal 
and administrative records, texts of publications, speeches, and news 
reports. Statistical data, heretofore available only In printed form, 
are being converted to computer-processable form by the archives them- 
selves. 



Development of archival facilities has necessarily been slow. 
First, support has had to be foundi to establish or continue them. Fees 
charged users for services cannot yet, and perhaps never will, make 
these organizations completely self-supporting. Data archives can be 
likened to libraries, cyclotrons, and medical research facilities whose 
benefit to society as a whole often requires more, or something other, 
than a strictly conmerclal justification for continuance. 
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Second, because these facilities are new and their require- 
ments for staff are also new, they cannot usually recruit people with 
the necessary training and experience through established academic or 
other channels. The kinds of people needed are not ordinarily des- 
cribed in personnel classification systems, and in some cases descrip- 
tions cannot be made because we do not yet have enough experience to 
routimze the various duties and responsibilities. Today people who 
work in the archive organizations must usually have unique combinations 
of social research and computer skills, and have reasonably great tol- 
erance for uncertainty and ambiguity regarding their personal status and 
the future of both themselves and their organizations. 

Third, the rapid growth and change that have characterized the 
computer industry have had detrimental effects on the ability of archive 
organizations to develop with speed and efficiency. Late deliveries of 
equipment, the additional training that archival computer specialists 
must have in order to use the new computers, and the discrepancies be- 
tween promised and actual computer capabilities have all caused signifi- 
cant differences between the plans of archival organizations and any 
current ability to meet those plans. 

Finally, the development of archival organizations has been 
slowed by healthy debate among the organizations about the relative 
priorities of various ways of implementing the several goals and pur- 
poses. For example, a critical problem is to increase the use of arch- 
ival facilities already available. On the one hand, we know that there 
is a large number of potential users; on the other hand, we also know 
that these potential users confront many problems in trying to take ad- 
vantage of the services offered. In order to increase use of the ser- 
vices, should we concentrate on better ways of matching the specific 
needs of the user with the service capabilities of the archives? Or 
should we concentrate on training programs that will help users make 
local computing and other facilities more responsive to their data stor- 
age, manipulation, and analysis needs, and otherwise take better advan- 
tage of already offered archival services? 

As another example, we know that the present set of archives 
does not store certain kinds of data that some potential need. 



There is a companion problem that there is no searching f >• for 
matching data needs with the specific, current holdings arious 

data-bank organizations. Should we emphasize the develo more 

thorough coverage in data acquisitions? Or should we gi r prior- 
ity to information systems that will improve the ability $ to 

determine whether any current holdings of archives matd .mediate 

needs? 

Important breakthroughs in the ability to makt lec- 
tions more widely available to researchers and more usef tor- 

ical research are expected to come when the data-collect na- 
tions themselves give greater attention to the problems in- 
volved. There are at least three alternative sets of ext that 

can guide our approach to problems of data usability, r solu- 
tion offered by the Inter-University Consortium for Pol i arch 
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ac the University of Michigan; the second is that developed by the data 
bank of the Center for International Studies at the Massachusetts In- 
stitute of Technology; and the third has been implemented, independent- 
ly. by the Data and Program Library Service of the Social Systems Re- 
search Institute at the University of Wisconsin and the Social Science 
Information Center at the University of Pittsburgh. 

The first alternative involves a highly specialized archive 
staff that operates between the data supplier and the data user, and 
requires the Investment of considerable resources In editing and re- 
vising both data and documentation, as well as otherwise cleaning and 
preparing data and documentation for multiple use. This approach 
tends to give priority to data that are known to be of reasonably high 
quality and that have a large probability of high use rates. 

The MIT group has developed a user-oriented computer system 
called ADMINS, that provides automated aids to the user for determin- 
ing discrepancies between data and documentation or for locating in- 
consistencies within the data, thus permitting the user to move directly 
from cleaning to analysis tasks. Advantages of this system are that the 
user can change from a given level of cleaning to a given level of 
analysis at will; he is not constrained by the philosophy about cleaning 
and data preparation of any given archive organization; and he does not 
have to rely solely on the data and services available from archival 
organi zations. 

The third alternative assumes that there must be some kind of 
human interface between the data source and the user and that the user 
must have considerable control over the processing and analysis of 
archivally available data. It differs from the Consortium alternative 
in that the interface organization limits itself to finding and supply- 
ing to the user information required for cleaning data or for handling 
internal discrepancies and then systematically organizing such informa- 
tion for the benefit of other potential users. It differs from the ap- 
proach of the MIT group in that it does not limit the user to a specific 
software system for cleaning and editing the data and documentation, or 
for analyzing the information, but rather provides or adapts from other 
sources any specific items of software that the users feel they need to 
use the available data resources. The major disadvantages of this or- 
ganizational alternative seem to be that the user is dependent on the 
human skills of the people that the local data-bank organization can re- 
cruit and train, and that a given user may have to become familiar with 
several computer programs that perhaps involve very different procedures 
and assumptions; in short, the programs do not necessarily constitute a 
ay stem. 



Present information indicates that archives with high use 
rates give first priority to the various needs of their users and, in 
the process of providing service, buffer and otherwise protect their 
users from the inadequacies of the social science information system as 
it exists today. At this time, it seems a reasonable hypothesis that 
the general-purpose, local-service archives will become the major 
sources of data and services to users. Reasons for this apparent trend 



are that local users are so diverse in their needs that national organl 
zations can respond only In gross ways to their problems, and that the 
computer and other facilities available to these local users require 
special kinds of attention to almost unique local problems. 
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NEW APPROACHES TO HANDLING TECHNICAL INFORMATION 



The position from which I view and assess new approaches for 
Improving communication of scientific and technical information is 
largely that of one who Is intimately associated with the original gen-' 
eratlon of such information by means of investigation and analysis, and 
the practical application of it to large-scale engineering development 
projects. From such a vantage point, I find it most profitable to look 
upon each segment of information handling as one of several successive 
steps that should merge with one another into a closed loop wherein 
there is ample provision for continual interactive exchange with appli- 
cations of new knowledge at its core, i.e., 

Investigation 
fi 

Applications 

Dissemination «± of 

New Knowledge 

H 

Exposi tion 

When one examines the nature of the services offered by infor- 
mation centers such as the 20 centers at Oak Ridge, the fundamental di- 
lemma of technical information transfer stands out clearly. Where there 
is a confined range of focus, we can fairly readily machine compile, 
analyze, and selectively retrieve factual data. However, for broad 
fields of interest, our abilities are extremely hard put to proceed much 
beyond some systematic means that selectively guide a user to lists of 
publications that hopefully contain the information he seeks. It is my 
belief that we will never achieve much success in closing this informa- 
tion exchange gap until we institute measures that intelligently screen 
our sources of technical material to separate wheat from chaff. It is 
my intention to later propose a specific means for reducing in-flow to 
technical information centers, but since any shift in that direction 
will be difficult to accomplish, let us first examine some means that 
are being employed to cope with our present state of partial inundation. 

The Automated Library primarily serves its users by compiling 
bibliographic reference listings, organized and stored for rapid selec- 
tive search according to the nature of subject matter of publications 
contained therein. The most popular device for machine retrieval coding 
involves the use of a glossary of keywords that may be fed to the com- 
puter in various combinations to query the file. If all works well, the 
user obtains a selective listing of publications, including brief ab- 
stracts of their contents, from which he can proceed to requisition a 
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library where documents are distributed either as originally published, 
or as some sort of photoreproduction. 

The Nuclear Safety Information Center (NSIC) at Oak Ridge is 
an excellent high-quality example of how far it is practical to extend 
the Automated Library concept, when augmented by professional guides or 
evaluators. By extensive computerization, NSIC now handles 12,000 new 
accessions per year. Its personal services Include the preparation of 
state-of-the-art reports, abstracts of the nuclear safety literature, 
indexed bibliographies of accessions, retrospective bibliographies, 
special reviews, and handbooks. NSIC also operates an SDI program, an- 
swers technical inquiries, and provides technical consultation. Popu- 
larity of its more personalized services has demonstrated that scien- 
tists and engineers active in the nuclear safety field need more than 
bibliographic listings, keyword coded, as a resource for keeping up with 
technical developments. The most striking response has come from eval- 
uations made of the SDI program. This program, initiated in October 
1965, now has 1700 users. Our records show that the cost of providing 
this highly personalized current-awareness service from a biweekly scan 
is about $55 per user per year. A recent survey disclosed that a sub- 
stantial number of users: would be willing to pay a charge within that 
range. 



It should be obvious that to bridge the Great Divide separat- 
ing our comfortable bibliographic reference referral services to work- 
able means for gaining quick access to specific useful new knowledge 
will require difficult compromises of enforced selectivity. 

About three years ago, we began the development of a computer- 
ized system capable of storage and on-line retrieval of large volumes of 
unstructured technical information contained in license applications for 
U. S. nuclear power stations. The system is known as CH0RD-S, Computer 
Handling of Reactor Data for Safety. CH0RD-S experience has shown that, 
by far, the greatest amount of effort and the largest number of problems 
arise in determining which bodies of information should be considered 
important enough to be included in the data bank, and how the files 
should be structured for logically connected search relationships. A 
multihierarchic structure proved satisfactory in CH0RD-S as a framework 
for search to selectively probe different levels, but contrasting de- 
grees of detail considered necessary by different potential users of the 
system required expensive and time consuming reworking of the data bank. 
Resolving such differences of opinion is difficult but, in retrospect, 
we might observe that their existence seems to allow some latitude in 
painful discretionary choices that lie ahead for information technology. 

In spite of enormous appetites of modern automated information 
storage devices, we cannot build communications channels with capacities 
to accommodate flows that can be maintained free from sedimentary ob- 
structions. We have little choice but to introduce positive rejection 
filters and diversions into the headwaters of our information streams, 
far more effective than any employed heretofore. Of the many options 
worth exploring (any of which will likely be offensive to most of the 
technical community), we should seek devices that induce minimum 



restraint on traditional rights of self expression, yet Invoke screen- 
ing with positive incentives to encourage high-quality reporting of 
useful new knowledge. 

I would like to see a few pilot projects, of limited scope, 
having the specific objective of developing practical techniques for 
reducing representative narrative technical reports to provable useful 
facts contained therein, and ordering these facts into a pyramld-like 
structure for interactive computer storage and retrieval. Toward the 
apex of the pyramid would be concentrated the most worthwhile factual 
revelations from contributors, and successively lower layers would be 
relegated to peripheral observations that qualify, or set boundary con- 
ditions, etc., on those facts. Such a file structure can be iterative- 
ly searched to varying depths selectable by individual users of the 
system. One of the main departures of this notion from more traditional 
information system practices is the screening and ordering of subject 
matter toward the specific objective goals of potential users rather 
than attempting to digest all segments of every report published. The 
selective pyramid approach can provide an opportunity to condense and 
selectively restructure worthwhile published information without assum- 
ing the overwhelming burden of preserving every phrase of the author's 
prose. Information specialists who can perform such surgery will have 
to be knowledgeable in the technical field and be capable of facing in- 
evitable accusations of censorship. If the technique proves successful, 
however, authors of technical reports might be persuaded to participate 
later on. 



It may appear that I have ignored or neglected to recognize 
some very real software/hardware problems known to be associated with 
computer input/output manipulations. The complexity of such problems 
will vary considerably, depending on what we can accomplish toward 
screening and restructuring source material. Even so, there appear to 
be no overwhelming obstacles. Substantial reductions in the time re- 
quired for a scientist or an engineer to locate specific information he 
seeks will, in my opinion, involve further application of now well 
understood principles of interactive computer access from remote ter- 
minals, particularly those having video-screen capability. The cathode- 
ray-tube terminal, as a two-way window to the internals of a computer- 
stored data bank, is an ideal medium ?or rapid trial -and-error search 
when followed by hard-copy retention of selective readout. The cost of 
equipment to provide that capability has been increasing. There are, of 
course, many stumbling blocks along the way of adapting telecommunica- 
tions terminals to requirements for an effective information storage and 
retrieval system, particularly in the matter of matching human response 
characteristics to those of machines, and achieving component compati- 
bility for both language and speed of transmission. However, our ex- 
perience indicates that our most serious impediments to progress arise 
from the inability or unwillingness of individuals to extend themselves 
across boundaries that isolate the overspecialized disciplines of infor- 
mation technology, electronic communications engineering, and computer 
science (hardware and software). 
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KNOWLEDGE-TRANSFER TOOLS OF SCIENTISTS AND TECHNOLOGISTS 



I focus on the future of scientific and technical communica- 
tion from the operational base of Battel 1 e ' s information analysis cen- 
ters. I relate three areas--scientific and technical journals. Informal 
communication media, and critical reviews--to the acceptance of future 
knowledge-transfer systems and media. These three areas were chosen be- 
cause they are interrelated and, more importantly, they encompass many 
of the communication methods of both scientists and engineers. 

Battel le's information analysis centers have the advantage of 
operating in a research laboratory environment. They are able to con- 
tinuously observe scientists and technologists at work with information 
and data, not only in connection with the centers' activities, but also 
in the laboratory. They experiment with new systems and with refine- 
ments of old ones, and obtain the reactions of real users to these sys- 
tems. This broad spectrum of user audience provides many insights into 
the working habits of members of the scientific and technical communi- 
ties. 



The main responsibility of the information analysis centers is 
knowledge transfer. This forces the centers to be quite pragmatic in 
their approach to information systems. Sponsors become concerned if too 
large a proportion of available funding is consumed by the information 
system per se. 

The tasks of scientists and technologists in the center en- 
vironments include the preparation of state-of-the-art reports, techni- 
cal compilations, data handbooks, and answers to technical inquiries 
from Government and industry. They are faced with limited budgets and, 
frequently, with limited time scales, particularly when go— no-go de- 
cisions depend on the results of their analyses. Under these condi- 
tions, frustrations, delays, excuses, or lack of information-system re- 
sponse are not tolerable to the technologist or scientist. All systems 
must be go 1 . 



Journal Literature 



The basic functions of the journal literature, current aware- 
ness dissemination and archival storage, are well known. However, jour- 
nal literature has other roles in technology transfer that are not so 
well understood. One of the roles that cannot be filled by either in- 
formal media or by the more advanced information systems is that of 
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continuing education. Technologists and scientists call this keeping up 
or keeping abreast. Faced with the mounting tide of publication, members 
of the scientific and technological communities have been forced to be- 
come more selective in their reading habits. (1) Nonetheless, each scien- 
tist and technologist, in his individual way, continues to absorb and 
assimilate some portion of the current literature. The synergistic 
effect of this acquired knowledge should be studied in depth when con- 
sidering possible loss of our current journals or in designing new know- 
ledge-transfer systems. Certainly one has to agree with de Solla Price 
that "technology tends to accumulate more in the minds of its practi- 
tioners than it does in a well ordered literature. "(2) 

Many small firms and independent inventors do not have the 
means to tap into the large information networks, nor do they have the 
means for establishing their own information storage and retrieval sys- 
tems. This large and diversified audience must not be ignored. Charpe, 
in the Fifth Annual Lecture of the Science of Science Foundation, 
pointed out that while over 80 percent of the research and development 
dollars were spent by only 200 American firms, two-thirds of the basic 
discoveries which resulted in important innovations came from the inde- 
pendent inventors and small firms. (3) Knowledge must be made available 
to this segment of the scientific and technical (S & T) community on a 
regular basis. At present, journals represent the chief means of com- 
munication with this segment of the user audience. 

On one hand we are concerned about the tremendous growth of 
the literature and its apparent decrease in quality, while on the other 
hand we are concerned about the financial failure of journals in the 
face of economic crises. It is probably true that many journals, per- 
haps some of our most desirable journals, will fail. I would suggest, 
however, that those remaining will be of greater value and capable of 
more support from the S & T community. Competition for available page 
space will cause increased quality in those papers that are published. 



Informal Communications 



Both the information science community and some portions of 
the S & T community are studying the role of informal communications 
media in modern science and technology. Several notable experiments 
have been carried out under the auspices of the National Institutes of 
Health. (4, 5, 6) 

While not generally recognized, an alert information analysis 
center capitalizes upon the informal communications media. Instead of 
responding to inquiries by means of bibliographies, abstracts, or in- 
dexes, a professional peer relationship can be established either 
through correspondence or by means of the telephone. Through these re- 
lationships, much advance information and data are exchanged. Alert 
scientists often detect trends and needs of the user audience at an. 
early stage, thus assuring that the more formal products of the center, 
such as reports and compilations, are timely and meaningful to the user 
audience. 




- 46 - 



Recognizing that the scientist will use Informal means of ob- 
taining the information he desires, particularly when recent research 
is Involved in a rapidly moving technology, the centers make such In- 
formal communications a part of the record. Trip reports, reports of 
telephone conversations, and correspondence become an important part of 
the centers' acquisition program. 

There are, of course, limitations to the Informal media. 

Total reliance cannot be placed on Informal communications. The scien- 
tists or technologist must have the backup of a good Information facil- 
ity plus his own knowledge of the subject. But, in spite of these 
limitations, the advantages of Informal communications are obvious. 

The interchange is often enjoyable and frequently leads to a rewarding 
Interchange of ideas. Often the scientist or technologist will obtain 
new information or unpublished data. His correspondent will Identify 
key papers that will enable the scientist to avoid reading bibliogra- 
phies and stacks of papers in the hope that he may find the information 
he is seeking. Is it any wonder that informal communications are im- 
portant to the scientist? 

As our national networks become larger and more complex, the 
user audience will seek to more fully utilize his informal channels. 
User charges will accelerate this trend since direct contact with the 
user audience will be replaced by the librarian or the purchasing de- 
partment. It is my belief that the continued existence of the informal 
channels of communication should be recognized and advantage taken of 
them. The information community should learn ways of tapping these 
transfer routes, relieving some of the strain upon the more formal 
transfer methods. 



Critical Reviews 



The critical review is an old but still modern approach to the 
scientific method. Bering of the National Institutes of Health defines 
a critical review as "a synthesis of existing knowledge relevant to some 
defined problem. It examines all of the hypotheses or theories germane 
to the question in hand, the evidence which gave rise to the theories; 
it assimilates new research results and other pertinent data into these 
theories, and identifies assumptions used to support the theories. "(7) 

Without the evaluation and compression of critical reviews, 
our scientists and technologists are faced with the full impact of the 
literature explosion. Critical reviews in areas of scientific and tech- 
nological endeavor can do much to reduce the impact of the literature 
explosion. 



However, it is necessary that such reviews be written by re- 
cognized experts if they are to be authoritative. The difficulties in 
convincing such experts to undertake comprehensive review tasks are well 
known. As stated by Overhage, enough critical reviews are not being 
written "because the incentives offered for writing them are less 
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attractive to qualified scholars than the Incentives for doing original 
research. "(8) 

A modern-day counterpart of the critical review, within the 
information analysis center, is the state-of-the-art report. (9) Such 
reports may be discipline-oriented or mission-oriented. Their proper 
preparation Is a laborious and demanding process requiring the utmost of 
an individual or a team of skilled scientists or technologists. Many 
skilled Investigators do not relish such assignments and seek to avoid 
them In favor of lab-bench assignments. 

We have found that by offering adequate compensation, recog- 
nition, assistance through the center, and stressing the Importance of 
the task, many scholars can be persuaded to undertake the preparation 
of state-of-the-art reports and critical reviews. Their first effort, 
while it may be approached with reluctance, can be a rewarding exper- 
ience. If this is indeed the case, they will frequently seek additional 
opportunities for similar assignments. 

Since state-of-the-art reports are frequently prepared in a 
rapidly evolving area, they may require revision within a year or two of 
their original publication. I believe this phenomenon to be of consid- 
erable importance. Time after time the centers at Battelle will receive 
contributions of unpublished information or data only after the impor- 
tance of such information was stressed through a state-of-the-art anal- 
ysis. The decision as to whether or not the report should indeed be 
revised and republished is dependent, of course, upon a continuing need 
by the user audience. 



Conclusions 



The ingenuity of the scientist or technologist in utilizing 
the information tools available to him should not be overlooked, nor 
should his needs and desires be overlooked. Both concentrated user 
audiences and diverse, decentralized user audiences must be considered. 

The ultimate decisions regarding which of the journals will 
remain, and which media of scientific and technological communication 
might ultimately replace our present methods, rests not with information 
scientists and documental ists but with the user: the scientists and 

technologists whom we serve. They will accept or reject our solutions, 
not by direct statements, but by either a very passive acceptance or a 
lack of use, dependent upon how our solutions fulfill their needs. They 
will not be vocal in their acceptance of new services, new systems, or 
innovative methods. They will be vocal, at least among themselves, of 
their rejections. And, they will continue to develop their own paths of 
communication and exchange. 

Careful consideration of the working habits, the methodolo- 
gies, and even the peculiarities of our user audiences should be made if 
our developments are to be of real value to them. Above all, rigidity 
of system operation will hamper rather than aid the individual user. 
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INFORMATION'S FATE AND FUTURE IN THE MARKETPLACE 



If the concept of votal systems approach has anything to offer 
an innovative and creative seminar on progress In scientific publica- 
tion, it is In the suggestion that we must go far afield even to solve 
problems that seem straightforward. Increased sophistication in the 
dissemination of information and in the exponential costs associated 
with such sophistication may indeed not provide all the needed solu- 
tions. This is not to argue against refinements, even costly refine- 
ments in the field of providing information, but if we are co achieve 
truly significant breakthroughs, we must look at the larger picture. 

The larger picture is that we are not dealing with scientific 
publication per se, but with the present and future generation of sup- 
erior social, commercial, and military technologies. Our goal should 
be one of accelerating the generation of such technologies. Growth, 
narrowly defined as increased real income per capita, depends on tech- 
nological progress and on those factors that promote or obstruct it.(l) 

The great bulk of our usual concern regarding scientific and 
technical information has had to do with the supply side, the problem 
of communicating-out. I submit that we should put our focus on the de- 
mand side, not only for the purpose of determining who are the users, 
but for the purpose of energizing the communicating-in of scientific 
and technical information. Determining who, in fact, are the users or 
potential users becomes a clue to the next, more important step of get- 
ting them into action and keeping them in action. 

The departmentalization, stratification, and institutionaliza- 
tion of industry, government, and the university mask the fact that the 
user of scientific and technical information is not a consumer at all 
but a type of entrepreneur. The distinction is between an essentially 
passive role and an aggressive, self-actuating role motivated by the 
implicit or explicit wish for either wealth or recognition. 

Hooking the communicating-in to the communicating-out and 

profitably exploiting this interrelationship is technology transfer, a 
term that includes both a supply and a demand concept, which I find more 
meaningful than the one-sided, supply-oriented dissemination and commun- 
ication of information. It is of course obvious that "there must be a 
far better climate, more receptivity to innovation and change, if there 
is to be more effective transfer. " (2) Undeniably, the cost of pursuing 
technological change is extremely high both in resources and in altered 
social values but, as Prof. Di Gregori of the University of Houston is 
fond of saying, the cost of not pursuing technological change is even 
higher. He cites the last 200 years of Chinese history as an example. 
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China was the only nation in the 18th Century capable of em.n ting 
England's industrial revolution, but did not because of t 1 jh cost in 
terms of her values. 



If we do not cease to be a pragmatic nation, we 
continue to devise, however painfully, new institutions to 
technological innovation even at the sacrifice of existing 
and existing social values. We must and will do whatever 
to energize the forces that provide technological innovati 



! 1 somehow 
vil itate 
_ t tutions 
" r essary 



Resistances to specific innovations (and resistances to a 
social mechanism that could greatly accelerate the establishment of 
radical new technologies) really do not concern themselves with the mat- 
ter of technological usefulness. As Barton Hacker points out, contro- 
versy (and resistance) arises in the perception of an innovation's 
social consequence (or the social consequences of a mechanism that could 
create a stream of innovations) . (3) The actual utility of an innovation 
(or an innovation-creating mechanism) plays a decidedly secondary role. 

A case in point is the defeat in Congress of the Department of Com- 
merce's 1963 Civilian Industrial Technology program. That proposal was 
designed to encourage and support additional research and development in 
industries that the Department regarded as technologically lagging. 
Industrial groups successfully opposed the bill because, with good rea- 
son, they feared that government sponsorship would upset existing com- 
petitive relationships. (4) 



New technology that would rapidly erode asset values will also 
be resisted and delayed. I do not claim that this resistance need be a 
conscious conspiracy; It's just that laws, values, the tacit understand- 
ings of enlightened self Interest, all provide the mechanism for much 
technological resistance. Effective resistances to technological inno- 
vation may also come from within the ranks of fundamental and applied 
science. A vested Interest in an idea can afford resistance to change 
just as effectively as a vested Interest in potentially jeopardized 
Income- producing property. 



The balance of this paper Is an attempt to sketch the outlines 
of a new Institution which, while tending In some ways to conflict with 
established values of our capitalist society, does offer a hope for an 
order-of-magnl tude Increase In our capacity to create new technologies. 
As a derived demand there can be expected to be a heightened applica- 
tion of scientific and technological Information. 

Tne Technological Risk Assumption Corporation (TRAC) Is vis- 
ualized, In Its simplest sense, as an Institution that guarantees all 
funds borrowed for Initiating and maintaining new science-based compan- 
ies. The borrowers are primarily scientists and engineers but may also 
Include specialized marketing and financial personnel as needed. Be- 
cause the TRACs must be heavily Involved In making short- and long-term 
forecasts of all kinds (technological, market, financial, etc.), I* 
appears advisable that they be departments or subsidiaries of estab- 
lished Insurance companies, and staffed with expert personnel In various 
fields Including scientists and engineers specializing In several 



specific areas or industries. Because of the great risks involved and 
as a consequence of the great potential national benefits, the ultimate 
guarantor underwriting losses of the TRACs must be the Federal govern- 
ment. TRACs would operate as a federally regulated industry with a 
guaranteed return on the quantity of funds that are guaranteed. 

The TRAC system would obviously be costly to the government, 
but this cost must be measured against the social benefits that can 
be expected through energizing the creation of science-based companies 
and industries. The TRAC system is fully harmonious with a continuation 
of the present venture-capital financing of new industry. Private 
capital could also invest in the science-based companies and assume 
bank loans after early critical uncertainties disappeared. The TRAC 
concept could also be employed in the expansion of small existing 
science-based companies. 

Though the TRAC concept might be attacked as negating capi- 
talism, it should be noted that it facilitates the formation of large 
numbers of new scientific ai.d technological entrepreneurial enter- 
prises and stimulates the creation of new technologies. In so doing 
it provides a derived demand to creativity, to research and development, 
and to broador and more effective use of the scientific information 
with which this conference is concerned. 
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ACTIVITIES TO ENHANCE COMMUNICATION IN THE AERA 



General Information about AERA 



The American Educational Research Association (AERA) is the 
largest and most representative association for the scholars and re- 
searchers concerned with educational Issues and problems. Almost 70 
percent of the nearly 10,000 members are academicians, the three larg- 
est subcategories being educationists, psychologists, and other social 
scientists. 

The rapid growth from slightly more than 3,000 members In 1964 
to the present number has Involved concomitant changes In member char- 
acteristics. Although the percentage of representation from public 
schools has held constant over this period (about 12-14 percent) the 
generalists, such as assistant superintendents of Instruction or curri- 
culum coordinators, have been replaced by specialists In research and 
evaluation. There also has been an Increase In the number of repre- 
sentatives from profit and nonprofit businesses and Industries. 



Evolution of the Formal Channels of Communication 



In the last five years, AERA has altered several components In 
the three bases of Its formal cormunl cations network: publications, the 
annual meeting, and training programs. 

Since 1964, AERA has added to Its publications the American 
Educational Research Journal (a scientific journal), a Handbook of Re- 
search on Teaching, a Curriculum Evaluation Monograph Series, and a 
seven volume series of readings. In the planning stages are an Annual 
Review of Educational Research and a quarterly publication patterned 
after Psychological Bulletin . 

Thus, the Association Is vigorously extending the network for 
data transmission, review, and assimilation Into theories for both re- 
searchers and practitioners. 

Since 1964, the Association's annual meeting has shifted from 
an aggregate of scientific papers read to report research In progress, 
to a meeting at which, In addition to the above function, leading scho- 
lars and researchers In education can comment upon the direction and 
quality of research and Its relevance to the major issues facing the 
educational enterprise. 
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In the past four years, AERA has operated thirty- five 5-day 
training sessions on such topics as multivariate analysis, Bayesian 
statistics, anthropological field study methods, and evaluation of in- 
structional materials. One year more than 750 persons were enrolled in 
eleven sessions. Moreover, a task force funded by a grant from the 
U. S. Office of Education is presently surveying needed kinds of train- 
ing and designing a variety of programs appropriate to AERA support. 



Studies of the Communication Networks 



In an effort to understand better the effectiveness of these 
changes, and to postulate other needed improvements, AERA has sponsored 
several studies and conferences. 

The most extensive study has been conducted by Johns Hopkins 
University Center for Research in Scientific Communication (CRSC) and 
has been described in two preliminary reports. (1) Or. William Garvey 
and his associates have plotted a flow chart of the process by which 
the typical educational researcher exchanges information with hi S' col- 
leagues. The information exchange that takes place at the national 
meeting was analyzed, as were the dissemination processes which use the 
scientific journal article as critical medium. The resulting data were 
compared with Information exchange activities of educational researchers 
In other disciplines. (2) 

These studies have led to several modifications of the annual 
meeting, the most Important being the distribution of the abstracts of 
papers as early as possible prior to the meeting Itself. Other changes 
were made to facilitate the Informal exchange of information. 

The second study, a survey of the readership of the Review of 
Educational Research, was conducted by Linda Harris of the Institute for 
Communication Research at Stanford University. These data led to the 
conclusion that (a) an annual review of educational research was needed, 
and (b) educational researchers could benefit from a journal that pub- 
lished unsolicited reviews. 

The need to understand better the flow of informal communica- 
tion led to a study of invisible colleges. This investigation by David 
Lingwood at the Institute for Communication Research at Stanford Uni- 
versity used sociogrammatic data to compare educational researchers to 
attitude researchers. While the study revealed some Interesting aspects 
of the behavior of leaders, It did not conclusively establish that there 
are any Invisible colleges among educational researchers. 

In November 1968, AERA assembled a group of sociologists of 
science, communication researchers , association executives, and educa- 
tional researchers to examine In what manner the characteristics of 
social systems In the behavioral sciences Influence the direction and 
the quality of research. The purpose of this Col loqulum was to deter- 
mine what an organization such as AERA could do toward more effective 



methods of organization of the society of scholars in education. The 
papers read at the Colloquium, a compilation of interviews with leading 
sociologists, and a recapitulation of the discussion will appear in a 
final report. In progress is an evaluation of the Colloquium's impact 
on AERA and its affairs conducted by staff members of the Bureau of 
Applied Social Research, Columbia University. 



Problems and Frustrations--A Summary 



The American Educational Research Association has shown re- 
markable vigor in expanding the formal communication network to better 
serve its constituency. It has done so, however, in largely tradi- 
tional ways: for example, by adding new publications, by allowing its 

regular publications to expand in size and number. (The technical 
journal has doubled its pages per issue in three years; and the offi- 
cial newsletter, the Educational Researcher, has increased from 49 pages 
in 1965 to 114 pages in 1968.) 

All this has raised some serious financial questions. While 
it has been easy to sell almost any publication in education, continued 
increases in printing costs, coupled with the fact that members are no 
longer willing to donate their services, makes it problematical that 
AERA will be able to continue to expand i ts‘ publ ication program--or even 
maintain what It now has. 

Realizing this, the Executive Officer and the Executive Board 
have tried for more than two years to locate a researcher who would 
be willing to attempt a comprehensive study of the complete communica- 
tions network. Such a study would furnish the Association with requi- 
site Information to explore Installation of the latest technological 
developments and Innovations In the storage, retrieval, and dissemina- 
tion of technical Information. The Executive Officer continues to seek 
a research unit to undertake this study. 
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DEVELOPING SOFTWARE INTERFACES FOR INFORMATION PROCESSING 



The role of information processing in the development of civ- 
ilization is central, ancient, and obvious. If the term information 
processing is understood as a generic descriptor for the principal sub- 
processes of storage, transmission, and retrieval of information, then 
it is clear that there have been just four major advances in information 
processing capability in the past 6000 years. They are: 1) the inven- 

tion of writing systems in the Near East in the fourth millenium B.C.; 

2) the invention of alphabetic writing systems by the Greeks about 800 
B.C.; 3) the invention of movable type and its application to printing 
about 1450 A.D.; and 4) the invention of high-speed, general-purpose 
digital computers about 1950 A. 0.(1) 

Each of these Inventions provided order-of-magnitude. increases 
in the capabilities of their users to store, transmit, and retrieve In- 
formation. Each of the first three was directly followed by major 
changes In the nature and capabilities of the societies that exploited 
them. Although It Is too early to assess accurately the Impact of com- 
puters ! n this sense, historic comparisons suggest that the magnitude 
of the computer's effect will still be greater since It provides greater 
Increases In Information processing efficiency. 

Each of the problems of Information storage, transmission, and 
retrieval has Its engineering, economic, and algorithmic coding aspects. 
R & D Consultants work In the latter two areas. 

The problem of storage Includes questions related to the ways 
In which natural languages code Information. Since the redundancy of 
English Is approximately 75 percent, It Is evident that there are sig- 
nificant economies that are theoretically attainable In the storage of 
natural text. We have studied this problem In several forms(2, 3) arid 
we are currently preparing a report on subject heading compression for 
the Library of Congress. Preliminary results indicate that elementary 
algorithms that take advantage of the affix-kernel structure of Eng- 
11sh(4, 5) can provide substantial word compression without any Infor- 
mation loss. 

Information stored in the form of printed text must be com- 
posed. Traditional composition techniques include certain conventional 
modes of representation that are primarily of an esthetic nature al- 
though there Is some evidence that they also Improve the reader's rate 
of retrieval. With the advent of high-speed composing devices, It has 
become necessary to develop methods for automatically performing these 
operations. Some of them, such as page formatting, require Information 
Inputs in addition to the text Itself, and these tend to admit a 
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relatively routine automation, although the matter of efficiency is cri- 
tical and often depends primarily on the nature of the programming lan- 
guage used to implement the algorithms. This in turn leads to problems 
of computer language design for t»xt processing problems. R & 0 Con- 
sultants is currently testing an experimental programming language which 
appears to have many promising features for such text processing prob- 
lems. 



Certain aspects of text composition appear to require addi- 
tional information other than the text itself, but careful analysis 
shows that the hypothecated additional information is actually already 
present in the text and can be unraveled in an algorithmic way. For 
example, the composition problem of hyphenating and justifying .ext 
lines can be solved without recourse to dictionary look-up procedures to 
provide a process that is both more accurate and less expensive than 
human efforts. Such a system, developed by R & 0 personnel, is incor- 
porated in the Mergenthaler Linotron. 

We think that many of the problems of error detection in text 
can be solved by algorithmic procedures of a similar nature. Certainly 
an economically significant portion of the spelling errors, as well as 
certain classes of format perversions can be readily detected, and in 
most cases corrected by algorithmic means. Much remains to be accom- 
plished in this area. 

Another Important problem of this type, and one that we think 
is accessible to existing technology, concerns composition of formulae 
from typescript (OCR) or typewriter-generated machinable Input. Apart 
from Its direct economic advantages, a solution would substantially de- 
crease the considerable delay between submission of a scientific manu- 
script and Its appearance In a permanently available form. 

Problems of Information retrieval have rightly commanded con- 
siderable attention In recent years. The exponential growth of Infor- 
mation archives (documented, for Instance, In Refs. 6, 7) creates Ir- 
resistible pressures against static access techniques. We have been 
particularly concerned with problems of measuring and Increasing access 
to archival collections, and to this end have undertaken studies of 
computerized library catalogs(6) the structure of bibl1ographies(6) , and 
the automatic creation of Indexes. The latter Is in many respects the 
most exciting and promising of the access-expanding techniques that we 
have Investigated. Our preliminary studies have shown that human-pro- 
duced, back-of-the-book type Indexes display a consistent structural 
pattern. The distribution of text references associated with the Index 
terms Is essentially exponential (8), and the grammatical structure of 
the Index entries themselves shows regularities which are of Importance 
when Indexing algorithms are considered. (9) Based on a number of stud- 
ies conducted during the past several years, an elementary Indexing al- 
gorithm has been designed, and applied to the text of a book. (6) The 
Index that appears In Reference 6 has received minor human post editing, 
as Is therein described. In all of Its measurable characteristics, this 
experimental Index Is consistent with human-produced Indexes. 
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Very little information has been published about the statis- 
tical and structural characteristics of indexes, although measurement 
of the performance of an algorithmic indexing procedure must evidently 
be referred to some standard. In order to remedy this omission, we 
have begun to collect and analyze the indexes occurring in a random sam- 
ple of some 1800 monographs drawn from a university library of medium 
size. Upon completion of this project it will be possible to assign a 
degree of certainty to statements about the nature of indexes, and also 
to provide meaningful comparisons of performance of various indexing 
procedures. 

It is important to recognize that an index to a text is a 
global function of the text. This means that it depends on the text 
considered as a whole, rather than on consideration of consecutive local 
portions of it with simple accumulation of the locally determined index 
terms. From this point of view, indexing algorithms contrast sharply 
with editing or hyphenation or page composition algorithms which rely on 
relatively small amounts of context in order to make their decisions. 

One consequence of this difference is that indexes based on literature 
occurring in a specified field of Interest can be cumulatively recom- 
piled (from a greatly compressed subset of the original texts) to pro- 
duce a cumulative Index which Is not simply the union of the Indexes of 
the papers and monographs that constitute the literature base. Cur- 
rently, such cumulative indexes are not published. However, with the 
availability of high performance and low cost automatic Indexing tech- 
niques, It will no doubt become desirable to construct and publish such 
cumulations on a regular basis for literature In Important and rapidly 
changing fields. 

There Is another class of retrieval problems that demands at- 
tention. In order to be effective In typical environments, a retrieval 
system should admit Incorrect Input requests; that Is, It should have 
some degree of stability with respect to the knowledge of the system's 
user. An elementary but Important example of the need for stability Is 
provided by Incorrectly spelled author's names given as retrieval In- 
puts; another relates to spelling errors In author or title designations 
that are due to mechanical errors rather than wrong or Incomplete Infor- 
mation. Indeed, It Is not too much to hope that error correction tech- 
niques of a linguistic nature could be Included In computer operating 
systems to assist In program debugging In a positive manner, but this 
will require a redundancy Increase in the format of most higher level 
software systems. 
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THE USER OF TECHNOLOGICAL INFORMATION, TARGET OR PARTICIPANT? 



The Denver Research Institute's Project for the Analysis of 
Technology Transfer has been studying the means by which scientists and 
engineers acquire information on new technology from outside their or- 
ganizations, and the implications of their behavior for technology dis- 
seminators. (1) 

The Investigation has concentrated on those knowledgeable 
senior people, often found In research and engineering groups, who 
serve as mediators between their group, Its Information needs, and the 
various external sources of Information. In certain Industrial firms, 
we found mediators In three functionally different categories: In 

research-oriented, product-oriented, and technical management groups. 

The research-oriented mediators were usually concerned with natural 
phenomena and analytic problems. They usually sought Information about 
basic scientific knowledge, analytic techniques, or new products, mate- 
rials, or components. Product-oriented mediators principally dealt 
with the application of currently or soon-to-be-available materials and 
components for the development, design, and production of a product. 

They often sought Information on design concepts, technological per- 
formance, new applications, and new products, materials, or components. 
Technical managers were not primarily occupied in technical work. They 
typically needed Information relating technological capabilities to 
market opportunities, or on the economic Implications of substituting 
materials or components. 

Charts I, II, and III summarize the rank ordering by respond- 
ents to 17 external channels of Information for two types of Informa- 
tion use, current awareness and problem-solving activities. 

The respondents were drawn from four manufacturing Industries 
with little defense/space orientation (l.e., batteries, medical elec- 
tronics, Industrial controls, and printing and reproduction machinery) 
and one technically oriented service (vocational technical education). 

We were Interested to see the Importance attributed by people In these 
firms to textbooks and handbooks for problem solving and to trade pub- 
lications for maintaining awareness. The charts suggest the crucial 
role of Journals for research-oriented personnel, and the Importance of 
supplier channels (both supplier personnel and supplier catalogs) for 
product-oriented and technical management people. 

! 

Ready access and faml 1 1 arl ty wl th a given channel cdrrelated 
directly with the Importance attributed to It. For example, the rela- 
tively new Federal government channels for reporting government-support- 
ed research were rated comparatively low. 
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~tHART I. IMPORTANCE RANKING OF EXTERNAL INFORMATION 
CHANNELS BY RES E ARCH -OR I E NT E D PERSONNEL 



FOR AWARENESS 
70 60 SO 40 30 20 10 0 




70 60 60 40 30 t' 10 0 

N * 110 



Professional journals 
Trade Publications 
Meetings 

Supplier personnel 
Vendor catalogs 

TextbooVs 

University and other consultants 
Customer personnel 

Government publications 
Libraries 

Abstracting services 
Patents 

formal courses 
Associations 
Mass media 
Clipping services 
fclssemlnitlon centers 
Other outside channels 



FOR PROBLEM SOLVING 
0 10 20 30 40 50 60 70 




0 10 20 30 40 50 60 70 

N • 1 1 7 



o 

ERJC 



64 






CHART II. IMPORTANCE RANKING OF EXTERNAL INFORMATION 
CHANNELS BY PRODUCT 'ORIENTED PERSONNEL 
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CHART III. IMPORTANCE RANKING OP EXTERNAL INFORMATION 
CHANNELS BY TECHNICAL MANAGEMENT PERSONNEL 
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Each of the mediators tends to tap many and varied channels. 
Some channels complement each other and add to each other's value. 

Some redundancy appears desirable; it Is probably Important for effec- 
tive dissemination, particularly If dissemination Is directed at more 
than one functional type of R & D mediator. 

Our study has led us to raise the following question: Should 
there be major concern with the technology dissemination function, or 
should the term dissemination be de-emphasized In favor of the term 
oormmioation and the concepts the latter fosters? Communication re- 
quires the participation of the user. Those responsible for improving 
technological communication may need to consider ever more active Inter- 
vention with the user or with the user's environment. The success of 
technological communication systems depends at least as much on the 
users as on the transmitters of Information. Therefore, It seems logi- 
cal that those seeking means for progress in this field should seek pos- 
sible changes in the user situation, as well as proposing possible 
changes in publication and dissemination. 
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INFORMATION SERVICES FOR INDUSTRY 



The fast changing and burgeoning character of the Industry of 
our State Is creating great technological and managerial needs. It has 
long been recognized that the School of Engineering has technical capa- 
bilities which, when properly channeled, can provide a vital assist in 
meeting those needs. We are now doing just that through the field work 
of the Industrial Extension Service and through the offering of timely 
short courses, conferences, and workshops using School faculty and ap- 
propriate industry personnel. 

At present, the Industrial Extension Service (IES) and our 
State Technical Services (STS) Program have a combined staff, consisting 
of 23 professional personnel representing eight engineering disciplines. 
The Field Services section provides for industry liaison, referral ser- 
vices, problem-solving, technical assistance to small Industry, and con- 
ducts most of the in-plant training courses. The extension faculty are 
responsible for the development and much of the presentation of the edu- 
cational programs. Both of these groups are located on the campus of 
North Carolina State University. Participation in the STS Program by 
other North Carolina institutions includes the computerized literature 
searching services of the North Carolina Board of Science and Technology 
and educational programs by Wake Forest University in Winston-Salem, 

East Carolina University at Greenville, and the Research Triangle Insti- 
tute at Durham. 

The IES provides the following types of information services: 

LIBRARY SEARCHES - identifying and making available 
book and articlo references on particular subjects 
or problems. 

DIRECTORIES - providing readily available informa- 
tion on individuals and organizations rendering in- 
dustry services as well as the location and avail- 
ability of North Carolina produced parts and 
materials. 

BULLETINS AND PAMPHLETS - containing information 
easy to understand and apply on numerous subjects 
of interest to small industries. 
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HANDBOOK LIBRARY - containing a wide range of hand- 
books In technical and managerial areas that are on 
loan free. 

REPRINTS - Including key comprehensive articles of 
general interest and detailed articles of specific 
technical interest. 

IES NEWSLETTER - providing current information 
month 1 y on the extension activities of North 
Carolina State University as well as other articles 
of interest to North Carolina industry. 

INFORMATION SERVICES - The information services 
program of the Industrial Extension Service seeks 
to make available to industry valuable sources and 
aids found in the technical literature. This is 
accomplished through the library and various kinds 
of publications. 

Various programs designed to fill the continuing 
education needs of industry are offered through personal in- 
struction, television and telephone media, .programmed instruc- 
tion materials, and a film library. 

SHORT COURSES AND CONFERENCES - Short courses and 
conferences are conducted on campus and around the 
State for updating and training of industrial per- 
sonnel. Courses are given in subjects as diverse 
as work simplification, concrete technology, con- 
struction management, fluidics, and critical path 
scheduling. New courses can be developed on re- 
quest. 

EDUCATIONAL TELEVISION COURSES - Educational Tele- 
vision courses are offered during late afternoon 
hours via the State's five-channel open-circuit 
educational television network. 

IN-PLANT TRAINING COURSES AND MATERIALS - In-plant 
training courses and materials are available on 
supervisory and technical topics. Courses are con- 
ducted by extension staff members on supervisory, 
managerial and technical topics. 

TELEPHONE-TRANSMITTED COURSES - Telephone-trans- 
mitted courses via the Eleotrwriter system ®re 
available to provide in-plant instruction without 
the necessity of frequent travel for either the 
student or the instructor. 
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FILM LIBRARY - The film library contains over 150 
training films on the general topics of supervi- 
sion and human relations, industrial econorny, in- 
dustrial engineering, industrial safety, and 
methods Improvement. 

LIBRARY OF PROGRAMMED INSTRUCTION MATERIAL (LOPIM) 

- Over 100 programmed Instruction (self-teaching) 
courses are available on such topics as account- 
ing, computers, electronics, quality and cost con- 
trol, machine control, management, and statistics. 

CORRESPONDENCE AND EVENING COURSES - Correspondence 
and evening courses for college credit are offered 
each semester. 



Engineering Assistance 



A staff of field representatives and extension specialists is 
available to help small industrial firms lacking engineering specialists 
to identify problems and to assist in solving them until the firms or 
some other source can reasonably complete the task. 

Also emphasized is the location and application of new techno- 
logical ideas, and the compilation and presentation of technical data 
on materials, products and processes which present opportunities for 
enterprise expansion or diversification. 

Sometimes assistance is provided by a combination of instruc- 
tion and demonstration. For certain common areas of need, an Industrial 
Extension Service staff member can be assigned to work with members of 
a firm's staff to provide on-the-job training based on the firm's opera- 
tions and problems. For this, the industry students are supplied with 
available text and training materials. 

In cases where the Industrial Extension Service is not able to 
provide complete assistance, it serves as a liaison between the request- 
ing firm and faculty members, professional consultants, state and fed- 
eral agencies, and other sources where the needed assistance can be ob- 
tained. 

Representative areas in which assistance has been provided 

are: 



• Feasibility studies for the manufacture of new products 

• Plant layout and equipment requirements studies 

• Liaison service in locating professional and technical 
expertise 
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• Manufacturing process trouble-shooting 

• Verification of product cost estimating procedures 

• Location of suppliers of equipment and services 

• Development of quality control, Inventory control, and 
production control procedures 

(i Methods Improvement analysis 
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MUSEUM DATA AS AN INFORMATION RESOURCE 



As people learn to make use of the information stores in li- 
braries, archives, government collections, museums, and other reposi- 
tories, today's needs for computerization and for a better understanding 
of how to take advantage of computerized data bases can only become mag- 
nified. The Museum Computer Network research project described here is 
one effort to explore the use of new tools and techniques that will make 
it possible for these repositories to cope with predictable demands. 

Fifteen museums in New York City and the National Gallery of 
Art formed a consortium in 1967 to examine the feasibility of a com- 
puterized catalog of their own and other museums' combined collections, 
and to implement a trial data bank. 

Written records of museum holdings are of two kinds, although 
the distinction is blurred in practice. Accession records consist of 
an object's identification number, data concerning its source, method of 
acquisition, physical description, and the like. Since no research or 
special knowledge goes into their making, accession records exist, at 
least in theory, for all museum objects. Catalog records contain data 
verified by professional research as to authenticity, provenance, his- 
tory, and bibliography. Only a small portion of museum holdings are 
fully and finally cataloged. 

Museum records are anything but standard. They vary in types 
and amount of information, how it is subdivided and labeled, the degree 
and method of indexing, vocabulary used, and format. Every kind of 
variation may be found, not only between museums, but within the indi- 
vidual department and the individual file. Most variations are random, 
but many embody policy, tradition, or opinion, and so must be kept. 
Underlying this diversity is a remarkable uniformity of factual content. 
The possibility of an organized data bank derives from the presence of 
comparable data, implicit or explicit, in nearly all records. The ex- 
traction of these data requires logical analysis, much of which can 
eventually be programmed but all of which is now done by people. 

Further, most museums own materials outside their field of 
interest, and these orphan collections are hardest of all to find and 
least likely to be cataloged. A general data bank could bring them into 
meaningful context and lead to advantageous exchanges, sometimes of ob- 
jects and sometimes of personnel able to catalog them. 

The files generated to date employ programs that were con- 
ceived for organizing, storing, and retrieving data of variable length. 



The programs have the general name, GRIPHOS (General Retrieval of Infor- 
mation Processing for Humanities Oriented Studies). 

The chief file is the Item file that consists of a random se- 
quence of items of any length, each describing one entity, i.e., an 
object, person, document, or event. Items in turn are divided Into any 
number of .units each of which conveys one logical fact or a cluster of 
related facts. The unit consists of an annotator (a code number), show- 
ing its significance, associated with a word, phrase, number, or para- 
graph of factual content. The unit may be considered a statement about 
the subject of the item. The units are of two kinds. The shorter, 
stating one fact only, and likely to be repeated in more than one item, 
are called descriptors. These not only convey information, but also 
serve as an index to the file. They alone are the subject of search and 
sort operations. 

The other units, knowti as free text , form a reservoir of sup- 
plementary information. They are generally longer and often unique to 
a single item. This, together with the conjunction of several facts in 
a single unit, makes free text unsuitable for routine searching and 
sorting. It would have been easy to combine all the free text in any 
one item into a single abstract, but users would too often be burdened 
with excess information. The text is therefore divided into as many 
units as necessary, each of which is assigned a code number consecutive 
with that of a related descriptor. For example, code #2 identifies the 
name of an object's owner (a descriptor) while #3 tells about the ac- 
quisition of the object by that owner (free text). 

Perhaps the most important, if quite imprecise, datum we have 
come up with is that it costs, all told and on the average, about $1.50 
to see an art museum's object record through, from card files to elec- 
tronic storage. This, assuming access to some nonprofit computing fa- 
cility, is probably a maximum figure. Individual items vary in length 
from almost nothing to hundreds of lines, but the average for records of 
this type seems to be 20 units and roughly 400 characters. 

Some problems might arise from museums' and individuals' 
rights to information. Where files represent a life work, the proprie- 
tor will hardly care to see their content rushed into print by a user of 
a data bank. There is also simple pride; museum publications set high 
standards of style and accuracy. The labor needed to maintain files 
helps to explain why relatively little goes to press. But if making 
data public through the Network is regarded as a form of publication, 
the same obstacle will arise, and the mass of semi-cataloged material 
will not be any more accessible than it is today. 

Some formula is needed to assure free circulation at least of 
basic inventory information (accession records) without researchers being 
deprived of credit or profit, and without institutions feeling their 
reputations threatened by every dubious descriptor. The question of 
organization beyond our experimental phase remains open. 

It should be noted that a network such as this lacks natural 
limits, geographic, linguistic, or disciplinary. The problems of museum 
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record keeping are those of record keeping In general, and the solutions 
now taking shape should apply as well to any subject matter. Indeed to 
any mixture of subjects. Machine storage capacity Is growing faster 
than data can be gathered. There Is literally no stopping place In the 
whole universe of recordable information. 
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CURRENT ECONOMICS OF PRIMARY JOURNALS IN 
BASIC SCIENCE AND TECHNOLOGY 

I present here a general philosophy for approaching problems 
of policy for the scientific and technical literature; data from a brief 
survey of the economic situation and problems of primary journals, and 
an attempt to construct arguments for or against certain policies by 
combining the philosophy with the data. 



Philosophy 



At the outset we must recognize that the value of primary 
journals Is measured by what they contribute to the progress of science 
and technology as a whole. Thus to make recommendations about their 
economics one should ask two interrelated questions: 1) If the nation 

{or the world) devotes a given amount of money and manpower to a certain 
broad area of science or technology, what proportion of these resources 
should go Into primary journal publication In order to achieve the most 
progress? 2) Given this proportion, what policies controllable by the 
publisher of journals, and what policies of government agencies and 
other sponsors of R & D toward these publishers, will yield the most 
ultimate progress? In all cases, the yardstick Is the health and pro* 
gress of the entire research or development effort, not the mere pro- 
vision of a publication outlet for a given number of papers, i H is 
our first basic principle. 

Who Is to be the judge of what contributes, and hov. to 

this health and progress ? The various entitles that now make ons 

about Information transfer--ind1viduals, university department 
libraries, companies doing research and development, scientific 
cletles, private publishers, government agencies, etc. --base these de- 
cisions on perspectives of varying breadth and varying range In time, 
and on motives that Involve altruism and self-interest In varying pro- 
portions. We believe that the diversity of viewpoints, motivations, and 
above all the expertise among these entitles Is a resource that can, If 
Intelligently used, contribute more to the health and usefulness of pri- 
mary publications than could an extreme centralization of decision 
making. Our second basic principle is therefore that those entities 
that combine a considerable economic or administrative power with some 
of the broadest and most long-range points of v1ew--0ST, NSF, many 
scientific and technical societies, etc. --should set policies that en- 
courage Innovation and freedom of choice by Individuals and smaller 
groups, and constrain these activities only to the extent necessary to 
ensure that the general direction of their choices is not antisocial. 
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Several rather general corollaries can be drawn from these two 
principles. The first of these has to do with allowing the users of in- 
formation to be the principal judges of its value to them, as compared 
with other things that society could provide them at the same cost. So 
far as is feasible in our present free-market society, whenever any in- 
dividuals or small groups are willing to pay a cost at least as great as 
the incremental cost of providing them with an information service (such 
as a subscription to a journal), they should be able to obtain this ser- 
vice. 



Two more corollaries have to do with the economic importance 
of items that do not appear In the financial records of journals, but 
that still represent costs to society for the complete process of get- 
ting information from its producers to Its users. One such item is the 
time spent by users. We must always keep In mind the value of the time 
spent by the users of journals in finding material in them and in read- 
ing and digesting it, and of the time that potential users may waste 
through failing to make contact with information that could help them. 
Another item that should not be forgotten Is t.he hidden cost In journal 
production, the value of the time spent by referees and editors who are 
unpaid or Inadequately paid, and of the services effectively donated by 
the employers of these referees and editors. No comparison of alterna- 
tives for journal production is valid without consideration of these 
hidden costs. 



Data 



Our brief survey assembled data on size, price, birth and mor- 
tality, etc. (the things Identifiable by examination of issues in a 
library) for about 250 U. S. journals In all fields (preponderantly the 
physical sciences and engineering), and a considerably smaller number of 
foreign journals. Data on circulation and on production costs and in- 
come were obtained for about 45 U. S. journals by correspondence with 
the publishers. 

The diversity of the journal population Is overwhelming. In 
bulk, the largest journals publish about 500 times as much per year as 
the smallest. The average journal has been increasing In bulk at about 
7 percent per year. The spread in price Is also enormous, the best buy 
in 1968 having provided 90 times as many words per penny as the taorsfc 
buy In our sample. A very Interesting fact Is that, whereas journals 
without page charges have usually been getting more expensive per word 
over the last decade, those with page charges have usually been getting 
cheaper. Circulations also vary widely, though few journals he out- 
side the range 1500 to 15,000. 

New journals are being born at a considerable rate and, In the 
U. S. at least, the mortality of journals of the type considered here Is 
negligible. As a result, the number of such U. S. journals Is Increas- 
ing at something like 6 percent a year. 
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Few fields are so specialized that a worker can rely on a 
single journal or even half a dozen for as much as two-thirds or 
three-fourths of his useful Information. The rate of obsolescence of 
material published In primary journals Is slower than Is sometimes 
supposed. 



Arguments and Admonitions 



An oversimplified application of the philosophy outlined 
earlier would be to say that the support of journals should be such that 
they can be sold at prices that will maximize the net benefit to society 
from decisions of enlightened buyers as to whether or not they should 
subscribe to the journals. In the real world, we desire, as a second 
goal, that the support policy be such as to encourage correct decisions 
on the whole range of factors having to do with the birth, death, and 
quality of primary journals. Finally, as a third goal, we desire that 
the journals have reasonable economic stability In the face of temporary 
fluctuations In conditions. 

A detailed analysis of each of the various combinations of 
pressures and decisions shows that for many of the combinations, social- 
ly beneficial decisions are encouraged by having a page charge or ether 
subsidy of pre-run costs, while for other combinations, there can be an 
advantage to having a major part of pre-run costs covered by subscrip- 
tion Income. Thus, the optimum support policy for encouraging good 
management Is a little uncertain. 

Since a large majority of the papers published In practically 
all fields of science and technology are supported by the federal gov- 
ernment or by Industry, we draw the conclusion that the page-charge sys- 
tem be widely adopted for journals published by nonprofit publishers. 

All government contracts for research or development should budget rea- 
sonable amounts for possible publication expenses, and these amounts 
should not be transferrable to other uses. If the necessary adnlnls* 
tratlve machinery can be set up, page charges shouid be paid directly 
to the journals by the sponsoring agencies, without passing through the 
nands of the contracting Institutions. The guidelines enunciated by the 
Federal Council for Science and Technology In 1961(1) for such payments 
seem to be basically sound, but will require seme minor modifications 
In the direction of widening the scope of possible policy decisions by 
journals. 



Besides this recommendation to the government, there are sev- 
eral suggestions chat can be made to publishers of Journals, particular- 
ly societies. One such Is for federation of a number of small publish- 
ing operations Into a single publishing organization that can publish a 
number of journals In such a way as to make possible employment of 
skilled personnel and maintenance of uniform work loads. Another Is for 
offering journals at a certain price per volume, with an unspecified 
number of volumes per year (private publishers often do this). Another 
Is for wider use of typewriter composition and photo-offset, or other 
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technologies based on composition within the production office. Finally, 
there is the desirability of systems of accounting that clearly separate 
pre-run, run-off, and other elements of cost. 
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COMMUNICATING CHEMICAL INFORMATION 



The American Chemical Society has long been a producer of 
primary journals and of abstracts and Indexes as well as an organizer of 
meetings. During the past several years a growing amount of effort has 
been devoted to organizing and disseminating chemical Information In 
ways that may build a base for optimum handling and use o * the rapidly 
growing volume of material. 

Study Is underway on the nature of Information flow to, 
through, and from meetings with the object of learning about the struc- 
ture of actual Information exchange In formally organized programs which 
are accompanied by a considerable amount of Informal exchange and which 
yield Irregular levels of formal transmission through publication. 

The journal system has been developed with publications of 
varying breadth and depth. A weekly news magazine, chemical & Engineer- 
ing News, designed to Inform accurately and lead to sources for details, 
cuts across the entire field of work Important to chemically trained 
people, selecting on a news basis the most significant Information. At 
the other end of the spectrum are research journals publishing only re- 
search papers and conmunl cations which have been critically reviewed 
(refereed) and edited. These journals perform alerting and awareness 
service as well as the archival function. In addition they form an Im- 
portant part of the base of social-professional structure of the scien- 
tific community, something which cannot be Ignored In the organization 
and dissemination of Information. 

Another set of publications Is the review journals, Including 
both comprehensive critical reviews of selected areas of sub-disciplines 
and concise reviews of closely limited fields that are of high current 
Interest. A publication of the latter type, Accounts of Chemical Re- 
search, contains reviews of four to six pages written by an active 
scientist of high authoritative standing In the field. The author Is 
asked to write almost entirely from his own current, live knowledge and 
give only the most Important references. The object Is to present a 
fresh and highly current view; the journal has been well received. 



Primary Publication Forms 



Studies are underway on the more effective design of journals 
and other forms of printed Information media. Possible designs Include: 

1. The Short Paper Journal --One form of this journal requires 
submission of both a manuscript In short form (not more than twice the 
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length of a Communication to the Editor) as well as a fully detailed 
form Including reproducible experimental information. The latter Is 
used in critical review or refereeing for acceptance and is filed in 
archives while the short form is published. The second form of short 
paper journals requires only submission of the short paper for publica- 
tion. Additional details must be secured from the author. 

2. Limited Scope Selective Journal—These journals would be 
developed by packaging selections of accepted papers according to group 
Interest profiles rather than according to sub-disciplines, the basis 
of most chemical journals today. Expected size of user groups would be 
between 500 and 2500. The subject matter might often be Interdisci pi in 
ary or Intersubdisciplinary with overlap among the many journals pro- 
duced. 



3. Selective Dissemination of Individual Papers--A continuous 
flow of papers related to the individual scientist's interests Is highly 
attractive in principle. If done by matching content of papers to indi- 
vidual profiles the content value should be high. The task of maintain- 
ing and servicing profiles for more than 100,000 scientists Is presently 
prohibitive, but may not remain so with developing technology. 

4. Title Lists with Individual Paper Avallablll ty--L1rcula- 
tlon of tables of contents would allow users to make their own selec- 
tions, but delay and processing appear cumbersome. 

5. Abstract Journals with Papers Available on 0rder--Th1s 
form offers better current awareness Information than Item 4, but sim- 
ilar disadvantages. 

6. Manuscript Prepr1nts--Manuscr1pt$ entered Into considera- 
tion for publication In journals are made Immediately available on order 
based on prompt publication of summaries. For proper functioning, this 
requires the authors' permission as well as treatment of material as 
private communications until edited and published. 



Computer- Assisted Photocomposition 



Four journals now are being produced by encoding manuscripts 
on tape for computer storage and handling. The tape Is used to operate 
a photocomposition machine (Photon) which produces the film base for 
making a negative for offset printing. This system is being studied re 
latlve to Its potential for use In a fully computer-based Information 
system In which a manuscript will require but a single Intellectual 
analysis and will be encoded only once for storage and use In producing 
the primary journal, abstract, Index, and other forms. Editorial and 
keyboarding operations are being combined Into a single flow to produce 
a machine-readable record with little variation from one publication to 
another. 
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Chemical Abstracts Service (CAS) 



The world's chemical literature has been abstracted by the 
American Chemical Society in its Chemical Abstracts since 1907. Since 
1962 the computer has been in use by CAS and work is underway to put the 
entire system on a computer base. 

The CAS system is an integrated man-machine organization that 
takes advantage of computer capabilities whenever they provide economic 
or technical benefits to help attain over-all ACS/CAS objectives. This 
system is designed to permit easy adaptation of new hardware and new 
data processing techniques as rapidly as CAS experimentation and com- 
munity developments provide the required technology. The following 
design principles are being employed: 

• The system and Its subsystems are designed to be indepen- 
dent of specific items of equipment wherever possible. 

• System-wide standards are fully defined and maintained. 

Each data element to be processed In the system is defined 
and given permanent, unique Identification. Program sub- 
routines which handle these data elements are carefully docu- 
mented and stored In the computer for routine use. Standard 
file formats Identify each Information record by code, define 
the location of the data, and state the length of each vari- 
able-length string. 

• The processing workflow is modularized. Individual modules 
are small and simple enough to permit ready adjustment or re- 
placement, and changes need be made only on the affected mod- 
ules. 

• The system Is founded on a unified, carefully Integrated 
data base which is built through a single analysis step for 
each document covered In the system. This data base provides 
information for all CAS publications and services and all 
searches of CAS computer files. Therefore, the data from each 
document are recorded In the maximum technical detail required 
in any CAS service or In any system based on a search of CAS 
files. The records reflect adopted standards for character 
sets, coding techniques, abbreviations, etc. for each output 
application, the data will be formatted to make the informa- 
tion content related to the specific output purpose easily 
identifiable by the recipient of the output. 

CAS processing activities will become increasingly automated 
throughout the shift to a computer base, but it is essential that regu- 
lar CAS services not be interrupted during this transition. To meet 
these objectives, a series of modifications tnust be made to the process- 
ing stream such that each change comes closer to the ultimate intended 
system while assuring continuity of the established regular services. 
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The CAS development program is based in seven areas of activ 



1. Computer-based, subject-oriented sorting routines to or- 
ganize output data into forms easy for humans to utilize. 
Routines are now being developed for sorting author names and 
associated document titles, and for ordering subject-oriented 
nonstructural concepts in combination with chemical nomencla- 
ture. The first operational system is scheduled for 1970. 

2. Direct computer control of every form of output display, 
Including composition of all CAS publications. The first 
version of a fully computer-based system capable of handling 
the complete range of CAS publications is scheduled for com- 
pletion by the end of 1971 with the processing of CA Issuee 
through the CAS computer composition system. The system is 
being developed so that relatively simple adjustments of com- 
puter processes will allow the use of new composition equip- 
ment as the state of the art develops. 

3. Use of computer support to simplify the keyboarding, 
proofing, and correction cycle and to eliminate multiple man- 
ual handlings of the same Information. 

4. Shift of an appreciable portion of the Intellectual edit- 
ing task from the professional staff to computer processing. 
Computer editing of direct structural Input Is well developed, 
and the development of computer editing techniques for nomen- 
clature has begun. Computer-based editing of nonblbllographlc, 
nonstructural Information lies In the future. 

5. Development of computer file organization and accessing 
techniques that will assure a responsive, reliable, and eco- 
nomical system with long-range viability. A great amount of 
skill Is needed In coping with the problem of building large 
computer-based Information files and of maintaining and using 
them Individually and In combination. The Interlinkage of 
CAS computer files will have to adjust as the files grow If 
the system is to remain responsive and economically viable. 

The manner In which the system Is used by the community will 
also deeply affect file organization developments. 

6. Close interlinkage of CAS activities with those of other 
primary publishers, secondary Information services, and re- 
source libraries. 

7. Development of effective means of supplying Information to 
Information centers for redistribution. The Interfaces In 
this area Include the development of data content and format 
standards for Information exchange and the determination of 
appropriate media for that exchange. Since much Information 
exchange is based on the documents in the system, a document 
registration system Is being considered. CAS Is also working 
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on algorithms for the unambiguous Identification of documents (Including 
both serials and individual articles within serials). 



Computer-Based Services 



CAS now offers the following computer-based services on mag- 
netic tape In a form that permits searching of the titles, abstracts, 
or Index terms contained In each Issue: 

t Chemical Titles --a bi-weekly alert to the titles of new 
papers appearing In 650 journals of pure and applied chemis- 
try and chemical engineering. 

• Chemical-Biological Activities --a computer-produced ab- 
stracting and Indexing service in biochemistry covering 
15,000 abstracts per year, dealing with organic compounds 
and their effects on biological systems. 

• Polymer Ccience & Technology- -a weekly, computer-based 
guide to the current journals, reports, and patent litera- 
ture of the chemistry, chemical engineering, and technology 
of polymers. 

• CA Condensates —weekly , computer-searchable tapes contain- 
ing complete abstract headings and CA Issue keyword Index 
terms for every journal article, report, and patent covered 
In the corresponding weekly Issue of CA. 

a Basic Journal i4fcati’av’te--b1-weekly , machine-language tapes 
containing the complete 04 abstracts for all articles appear- 
ing In 35 chemical journals. Headings and full texts of 
abstracts can be searched by computer to derive a bibliography 
of articles of Interest to toe searcher. Printed copies of 
abstracts accompany the tapes. 



Relations With Other Centers 



Output centers using CAS computer-generated materials are 
operating In five U. S. university centers and In Canada, Denmark, 
Sweden, and England. The Netherlands and Belgium are expected to be 
participating soon. 

International cooperation In chemical Information systems Is 
being developed through agreement between the ACS and ihe United King- 
dom Consortium for Chemical Information, of which The Chemical Society 
(London) Is acting as the contracting party, and between ACS and the 
Gesellschaft Oeutscher Chemlker of Germany. Each of the European or- 
ganizations will assume responsibility for marketing In thel* countries 
the publications and the computer-based Information services of CAS. 
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Both are working with CAS in preparation for abstracting the chemical 
literature of their countries for Input to the CAS system. Discussions 
with the Japanese have begun, to consider the possibilities of a third 
such international agreement. 



Other Dissemination Media 



All ACS journals now are on microfilm that is leased to users 
who pay license fees per original page; the fee permits the lessee to 
make hard copies to the extent that he wishes for use within his own 
organization. Microfiche also is being examined for various uses, In- 
cluding the recording of experimental details and data that may not be 
included in the pages of published journals. 

Audio and video tapes offer promising possibilities for Infor- 
mation handling. ACS Is developing for delivery in January 1970 an au- 
dionews system that will furnish 15-minute cassettes weekly to subscri- 
bers. While the first of these units will be devoted primarily to tecn- 
nical and economic material, an audionews unit for research and develop- 
ment is under consideration. 
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USER ORIENTATION OK PRIMARY JOURNALS IN PHYSICS 



The principal communication mechanism In the discipline of 
physics, the primary research journal, is being redesigned. Redesign Is 
made necessary by the growing problems of size, cost, and time delays 
that are outmodlng all conventional mechanisms such as primary, ab- 
stract, and titles journals. Redesign Is made possible by sympathetic, 
but pressing, national support and by new technologies. This report 
focuses on preliminary plans to supplement the primary journal system 
In physics with a user-oriented set of journals (or their equivalents) 
that will permit primary journals to continue to serve a unique but 
modernized role. These plans are not final at this time and are de- 
pendent on financial support from the National Science Foundation as 
well as on approval by the AIP Governing Board. 



Growing Problems with Primary Journals 



AIP's position as publisher of about 35 percent of the world's 
literature In physics leads us to anticipate that the total stack height 
of primary reseaich journals In physics for the world In 1969 will be 
about 60 feet. Let us examine the Implications of the 60-foot stack by 
comparing It with the Information problems of physicists In 1940. A 
volume of The Physical Review for the entire year of 1940 equalled In 
thickness that for one month of today's production. In 1940 physicists 
had little difficulty In monitoring the research completed or In pro- 
gress anywhere In the world In their subdiscipline. In 1940 most phys- 
icists could and did subscribe to The Physical Review or other physics 
journals In which their kind of physics was likely to be published. In 
1940, Individual recording and use of material from physics journals 
consisted of typing, photographing, or handwriting excerpts from the 
journal pages. 

Not so today. Today the proportion of physicists that sub- 
scribe to The Physical Review Is decreasing. The reasons are obvious. 
The first Is one of sheer bulk. Even though some physicists' offices 
are adequate to house 5-foot-wlde annual productions of the Review, the 
realization that a 15 percent growth rate Implies a doubling In 7 to 8 
years Is discouraging to Individual physicists. The second reason Is 
cost. Even though the Review and other physics journals published by 
AIP are Inexpensively produced and are available at low subscription 
prices per page {average nonmember price for AIP journals Is about 0.7 
cents per page), the total number of pages Is going up. Thus, when 
some single Issues of such journals as the Review or the Journal of 
Applied Physics reach 700 pages, the Issue must be sold to nonmeirbers 
for almost $5, a substantia) sum. 
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Another reason is relevance of the material in any one journal 
issue. If the material that is directly relevant to an individual phys- 
icist consists of 10 or 20 pages per issue, he is better off financially 
making photocopies even at 10$ per page, thereby producing his own cus- 
tomized separates journal. If, in addition, because of the spreading 
geographic as well as the growing Interdisciplinary nature of science, 
he realizes that the physics in which he Is Interested Is no longer 
limited to a single journal, but may appear in any one of 70 journals 
published by foreign as well as U. S. publishers, commercial as well as 
society, then he discards the convenience of his own personal office 
copy and relies increasingly on library copies plus photocopying. 

Let us speculate on the future of the conventional journal in 
physics if no changes are Instituted. Journals are likely to become 
library editions and reference works in the true tradition of an ar- 
chive. This bodes for an uncoupled, massive, antiquated conmunicatlon 
system that will collapse of its own weight, unless steps are taken to 
redesign the journal system to be a more efficient communications me- 
dium. Otherwise, in desperation, much less efficient and less respon- 
sive schemes such as preprints will be invented to satisfy the needs of 
small isolated components of the physics user community. The future Is 
forcing on us the development of a coupled, modernized Information sys- 
tem which, In ir\y opinion, must be built on the Important foundation of 
e revitalized journal system. 



Analysis of Problems 



The basis of science information problems Is obviously the 
well -publicized exponential growth of scientific and technical litera- 
ture. The present doubling time of 7 1/2 years for physics literatu'e 
means a particularly severe problem for the discipline of physics. In 
1968 alone, over 50,000 research articles In physics were published 
throughout the world In about 500 journals; over one-third of these 
apoeared In the 34 archival and translation journals published by the 
AIP. 



The seriousness of the problems for physicists can be appre- 
ciated from the implications of the nunfcers In Table 1. The numbers 
show that the members of the AIP publish In AIP journals about 4 times 
as many articles per year as the members of the American Chemical So- 
ciety publish In their journals, and similarly for members of the IEEE. 
These numbers are not conclusive, but they do suggest that physicists 
publish more In, end depend more on, primary journals than do chemists 
or electrical engineers. Therefore, when the journals start having pro- 
blems, physicists become particularly concerned and sensitive. 
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TABLE 1. 1968 MEMBERSHIP AND JOURNAL DATA 





AIP 


ACS 


IEEE 


Membership 


48,683 


126,269 


186,298 


Primary Journals 


21 


20 


36 


Editorial Pages 


65,887 


40,225 


23,759 


Approx. Pages 
per Member 


4/3 


1/3 


1/8 



The exponential growth is blamed by some on the growing pol- 
lution of the literature. This would not appear to be the case in 
physics, since the number of pages published by member physicists in 
AIP journals has been a reasonable constant for about a decade. There- 
fore, the growth in the literature can be attributed to the growing 
number of physicists who publish annually and not to their increasing 
productivity (and any concomitant decrease in quality). 

The expanding volume is flooding even the narrowest special- 
ties, making it difficult for the individual physicist to keep up with 
areas of immediate interest to him, let alone those bordering areas and 
new specialities with which he should be familiar. The situation is 
aggravated by the producer orientation of the present journal system in 
that the author largely decides when, where, and in what manner the in- 
formation is to be presented. The user is left to cope with the flood 
as best he can, often resulting in Information coming to his attention 
too late to be of use. 



User Orientation as One Solution 



One of the quickest and most expedient ways of solving the 
growing information problems of the physics community is to include in 
the design and operation of the primary journal system the needs of the 
users as well as of the producers. The orientation can be accomplished 
by producing a completely new. set of user physics journals resulting 
from a manipulation of the articles in the archival physics journals. 

The content of the user journals would be determined by standard in- 
terest profiles of the scientists and engineers who would use the arti- 
cles in the journals. 

The possibility and practicality of producing user journals in 
physics through the AIP results from the relatively small number of 
U. S. physicists (32,000), from their inclusiveness in AIP membership 
(about 75 percent), and from the concentration in publishing at AIP 
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(35 percent). This possibility can be pursued on a trial basis with AIP 
literature first. After demonstrating to ourselves and to commercial 
and foreign publishers that user journals will strengthen the journals 
of the individual participants and will make possible the continued con- 
tribution of journals as the fundamental science communication mechan- 
ism, we can then expand beyond AIP literature to Include the world's 
core literature In physics. 

Let us now become specific about user journals In physics and 
the future information system trends at AIP. In the past, we have owned 
and operated a set of journals for our membership; these journals satis- 
fied their needs as users as well as authors (producers). In the fu- 
ture, the operation of the archival journals for our members as authors 
will continue to be necessary. However, as users they are Increasingly 
Interested in obtaining rapid and economical research results regardless 
of what geographical location of the world or what scientific or engi- 
neering discipline was the production source of the information. There- 
fore, AIP plans to expand Its marketing of journals beyond the 21 jour- 
nals published by us. We plan to market at least 13 translation jour- 
nals, the eight journals of the Institute of Physics and the Physical 
Society (London), and a few other foreign journals. The marketing will 
Include eventually bilateral agreements and arrangements on the handling 
of hard-copy, microform, bibliographic material on computer tape, Index- 
ing, and integration into what has been called the National Information 
System for Physics (NISP). It is our hope that we can go beyond the 
total of 53 journals now marketed to include in NISP substantially most 
of the world's physics literature. 

One result of NISP will be the production of the user journals 
or their equivalent. For the purposes of illustration, let us assume 
that AIP will make available 300 user journals with journal contents 
determined by standard interest profiles modified annually. The user 
journals will select material from the archival journals slightly before 
the time of publication of the latter. Whereas the articles in the 
archival journals appear only once, a given article would appear in as 
many users' journals as would be required. Articles of broad interest 
would most likely appear in several user journals. 

The particular formats of the user-oriented services and com- 
binations thereof that will find user acceptance will have to be deter- 
mined by market surveys and pilot tests. Those now being studied in- 
clude: 1) Selective notification of information, a listing of ar- 

ticles in the journals during a given time period that match the stan- 
dard profiles of the 300 or so special interest groups in physics; 2) 
Packages of microfiche, selection of articles in microfiche form being 
sent to subscribers at periodic intervals; 3) Cartridge of microfilm, 
selection of articles copied onto a 1 6-nrri. strip and mailed in a car- 
tridge to subscribers; and 4) Hard copy selection, user journals pub- 
lished from offprints of the original journals. 

Emphasis in the production of the above sets of user journals 
or their equivalents would be on quality, timeliness, and economy with 
the objective of giving to physicists and other users a customized 
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service superior to their own (haphazard) collecting and photocopying. 
It Is our goal to make articles available simultaneously In the user 
and In the archival journals. It Is also our goal to Integrate the 
concept of the user journal into the over-all National Information 
System for Physics with Its long-range goals of a sophisticated (Inter- 
active) communication system that will benefit all scientific and tech- 
nical users of physics Information. 
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Case Western Reserve University 

Division of Special Interdisciplinary Studies 

Cleveland , Ohio 



THE TECHNOLOGY TRANSFER ANALOG 



As an historian of technology, I make one basic assumption In 
approaching the problem of improving the communication of scientific 
and technical information to Its users. This assumption Is that tech- 
nology Is basically an Information system, no matter whether we are deal- 
ing with machines, tools, processes, the organization of work, or any 
other of the n\yr1ad elements involved In technology. If technology can 
be viewed as an information system, then the process of transferring 
technology can be viewed as a special case of the dissemination of In- 
formation. In brief, the transfer of technology might serve as an ana- 
log for the problem of communicating scientific information. 

The transfer of technology, like most problems connected with 
the transfer of information, has many different aspects. First, of 
course, is the transfer of information from one generation to the next. 
Since this is the entire question of education viewed as a device for 
transmitting man's social inheritance and accumulated knowledge to pos- 
terity, it is a problem also affecting science, literature, laws, at- 
titudes, values--indeed, all elements of our culture. 

Second, in the context of the national economy, the problem of 
technology transfer applies to the transference of tools, techniques, 
machines, processes, etc. from an initiating firm to other firms in the 
same industry. Another category is the transfer of such items from one 
technology to another technology, as, for example, transfer of the tech- 
niques of air-frame manufacture to the construction of prefabricated 
housing. 

One can also look at the dissemination of technological infor- 
mation in an international context. One category is technology transfer 
from one industrialized country to another. Another involves the more 
difficult but more pressing problem of transfer from industrially devel- 
oped to underdeveloped countries. This is a transfer between cultures, 
one at a sophisticated level of technology to one at a more primitive 
level . 

Finally, there is an entirely different aspect to this problem 
which does not involve technology transfer per se. It is the informa- 
tion dissemination problem from basic science to technological applica- 
tion. This major problem has been catapulted into even greater signifi- 
cance, perhaps unwarrantedly so, by the common view that technology is 
simply applied science. If that is so, the chief problem which faces 
us is to acquaint technologists with the results of the latest scienti- 
fic research so that they can immediately proceed to apply them. Since, 



ERJC 



however, technology Is much more than, and sometimes quite different 
from, the direct application of pure scientific research, this problem 
Is more complicated than It might appear. 

Where do we stand In our knowledge of the dissemination of 
Information, whether considered as education, as the transfer of tech- 
nology, or as the application of science? The answer, unfortunately, 

Is that our knowledge lags far behind our Intentions. In the applica- 
tion of basic scientific research to technology, for example, historical 
scholars cannot agree even on the meaning of those few specific cases 
that have undergone thorough study, let alone arrive at generalizations 
having wide application. There has, for example, been much study of the 
role of science and technology In Janies Watt's Invention of the steam 
engine, some scholars claiming that Watt was led to his Invention by 
Joseph Black's researches on specific and latent heat, while others 
claim that Watt invented the separate condenser empirically and without 
any basic understanding of heat phenomena. 

Helping to mislead us on the question of science-technology 
relationships are charts showing Increasingly swift application of basic 
scientific discoveries. These charts indicate that It took 1600 years 
from Hero's aelopile to Watt's steam engine, 25 years from Faraday to 
the electric generator, and about five years from Bardeen and Shockley's 
research to the commercial use of transistors. Extrapolation suggests 
that by the year 2025, it will take milliseconds to go from enunciation 
of a basic physical theory to Its practical application. 

The trouble with such charts, of course, is that they rely on 
selective evidence. Judicious selection of other bits of evidence could 
perhaps prove that the time between basic research and application is 
lengthening rather than diminishing. Additionally, such charts some- 
times depend on very tenuous historical connections. Watt's steam 
engine did not come from Hero's aelopile but from Newcomen's atmospheric 
engine of 60 years earlier, and the Newcomen engine was an outgrowth of 
interest in pressures deriving from the preceding 17th century. On the 
other hand, work on the theory of the transistor can be traced back 
further than Bardeen and Shockley, to the 1 9 20 ' s and earlier in this 
century. 



What we do know historically about science-technology rela- 
tions would seem to indicate that, for the most part, new technology 
comes chiefly from old technology without much relationship to the 
latest s ijntific discoveries. If technology today seems more reliant 
upon science than in previous times, perhaps it is because technology 
has itself become more of a science than an empirical, craft art. The 
clues to the relationships between science and technology are thus not 
to be found in any simple transference of scientific data to the tech- 
nological community but must be sought in more circuitous routes. This 
might lead us to redefine the problem. Perhaps it is not a question of 
getting the latest pure science into the hands of technologists but 
getting the latest science-based technology diffused to technologists. 
Solutions to this problem might lie in changes in engineering education, 
in the organization of industrial research teams, in the patent system, 
in professional societies, and in engineering publications. 
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In developing an analog of technology transfer, one finds few 
historical studies leading to meaningful generalizations. These 
studies have only recently gained prominence, partly because of attempts 
of NASA and the Department of Defense to justify their research expendi- 
tures and of attempts, to advance the technological level of Industrially 
backward nations. One central fact of historical studies has emerged, 
namely, the prime importance of the person-to-person quotient, or, more 
simply, the role of people In the process of Information dissemination. 

History shows, for example, that American technology during 
the colonial period relied almost entirely on British technology brought 
by immigrant craftsmen from their homeland. This continued during In- 
dustrialization In the first half of the 19th Century when the U. S. 
simply Imported British machines and tools and the workmen who knew how 
to operate them. Alternatively, Americans went abroad to learn about 
the machines and returned with this knowledge and the machines. Today, 
we are exporters of technology but we find it necessary to export, to- 
gether with machines and processes, the operating know-how that must be 
conveyed by people. 

An eminently successful Illustration of the effect of person- 
to-person transfer has occurred in agriculture. Productivity of Ameri- 
can agriculture has soared since the beginning of the 20th Century at a 
rate outstripping that of improvement in industrial production, although 
more attention has been given to Industrial phenomena such as the assem- 
bly line and automation. How was the agricultural revolution accom- 
plished? Textbooks attribute the sharp rise in productivity to the de- 
velopment of scientific agronony, the use of machinery, and the appli- 
cation of capitalistic methods to farming. True, but how were scienti- 
fic methods made known to the individual farmer stuck out in the coun- 
try? How did he learn about new strains of hybrid corn and wheat, the 
fertilizers, and pesticides which could multiply his yield? Where did 
he find out about the machinery and how to operate it? And how did the 
farmer learn to apply industrial techniques? 

First we must ask where the knowledge originated. The ori- 
ginal research had two sources: 1) educational institutions, mainly 

state agricultural colleges which are land-grant institutions; and 2) 
laboratories, mainly those of the U. S. Department of Agriculture and of 
the state agricultural experiment stations. The colleges transmitted 
some of the information to the newer generation of fanners through edu- 
cational processes. However, the bulk of continuous dissemination was 
carried by state agricultural extension services and county farm agents 
who brought new knowledge directly to the farmer on the farm. There 
were, of course, other means of research dissemination--farm journals, 
county fairs, and demonstration projects, some of the latter effected 
by commercial seed producers and manufacturers of farm equipment. These 
transfer agents also relied largely on personal transfer of knowledge. 

The effectiveness of such means of disseminating research re- 
sults to the user is also demonstrated by their success abroad. Dra- 
matic increases in rice and wheat production in food-hungry countries 
resulted from the application of farm extension services to these 
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nations by which people were taught how to use fertilizers, pesticides, 
and farm machinery and how to plant, cultivate, harvest, and store. 

Is, however, the example of transfer of agricultural technol- 
ogy pertinent to questions of the transfer of scientific research data 
within the scientific community? To a certain extent, scientists are 
used to obtaining Information through reading, and demonstration and 
personal communication are not essential. One might note, however, that 
the community has sought Interpersonal communication since the time of 
Plato's Acadeny and the Museum of Alexandria. At the time of the Scien- 
tific Revolution, personal communication became Institutionalized with 
the formation of the Royal Society and the Academle des Sciences. Today 
the flood of publications makes it almost Impossible for the scientist 
to read everything In his own field, and he has resorted to invisible 
colleges and conferences to help him, In a sense, separate the wheat 
from the chaff and to keep himself up to date. Additionally, growth of 
postdoctoral research projects, exchange professorships, and Interna- 
tional scholarships for training scientists of backward countries In 
the schools and laboratories of developed nations testify to the need 
for personal Interaction. A tendency to Idealize and depersonalize 
science and technology overlooks the fact that these are human endeavors 
in which personal exchange can be more efficient than impersonal mechan- 
isms. 



Nevertheless, even institutionalized modes of personal contact 
cannot cope with all of the information dissemination requirements. 

Some way must be found to store data, to retrieve the useful data, and 
to bring that scientific knowledge to the individual who can apply it 
fruitfully. If we accept the idea that the transfer of knowledge re- 
quires people as well as publication, we appear to need a new breed of 
middleman, an information-transfer specialist, whose particular task 
would be to bring the results of research to its possible users. One 
model of this specialist is the entrepreneur in technological enter- 
prises. He brings together the basic technical idea, the business ex- 
pertise, the economic demand, and the capital necessary to translate an 
ingenious idea to a profitable innovation. Many an invention would 
have been stillborn but for the entrepreneur who saw its possibilities 
and successfully brought together the many elements required to convert 
it to practical application. Watt's steam engine, for example, prob- 
ably needed the entrepreneurial work of Matthew Boulton. Similarly, 
Edison would probably not be remembered as America's outstanding inven- 
tor had he not combined entrepreneurial qualities with his inventive 
genius. 



The research manager in large-scale industrial research lab- 
oratories is sometimes like the commercial entrepreneur. He brings to- 
gether a team possessing scientific knowledge and technical expertise, 
adding his own knowledge of the economic constraints and the social and 
economic needs. In a sense the successful research manager works as an 
agent to disseminate knowledge from many different fields to produce 
profitable applications. 
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It seen* worth Investigating the characteristics that could 
be Institutionally developed In people for the role of Information- 
transfer specialist or information entrepreneur. However, we must 
avoid the mistake often made by professional educators who become so 
Infatuated with techniques (for example, the techniques of Information 
dissemination) that they lose sight of the substance to be taught. 

Good farm agents very likely know a great deal more about farming than 
they do about the techniques of Information transfer. Perhaps we 
should take our clue for solving the problem of communicating scientific 
and technical Information from those who have already done It success- 
fully. 
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COMMERCIAL INFORMATION PRODUCTS AND SERVICES 



CCM Information Corp. Is a wholly owned subsidiary of Crowell 
Collier and Macmillan, now the largest book and general education pub- 
lisher In the country. Our subsidiary Is a publisher of books, jour- 
nals, and Information services n a variety of subject fields. Created 
primarily to fill Information needs that can best be served through non- 
tradltional publishing methods, our services stress effective use of 
computer processing and microfilm techniques. 

Much of our product Is published In several media, depending 
on customer requirements. Not infrequently vie publish a given data base 
in print, microfiche, microfilm, and computer-tape form, and will also 
offer special search services based on access to our computer. Our bib- 
liographic computer files are all based on the Library of Congress MARC 
II format to facilitate use by libraries with their own computer-based 
systems. 



Our long-range objectives include not only the publishing of 
catalogs and definitive indexes to important information files but also, 
whenever possible, the publication or distribution of the substance of 
what has been indexed. 



Products and Services 



The Pandex Current Index to Scientific and Technical Litera- 
ture is a data base that stores, annually, title and author information 
on approximately 250,000 journal articles, 35,000 VSGRDR technical re- 
ports, and 6000 new books. It is primarily a current-awareness service 
through which information is provided on published literature, frequent- 
ly within 12 days of publication of the original source document. All 
works are indexed under an average of six computer-generated and the- 
saurus-controlled subject headings, as well as by author. The data base 
is available to customers weekly in magnetic tape form, biweekly in 
printed format (about 400 pages per issue), and quarterly and annually 
in microfiche. 

Three separate publications have been developed from our new 
computer-based information file, Current Index to Conference Papers. 

Here the emphasis is on speed of processing to provide up-to-date infor- 
mation quickly, frequently before some of the cited papers have even 
been delivered. 
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The quarterly publication, World Meetings , provides Informa- 
tion two years In advance on Important scientific and technical meet- 
ings to be held throughout the world. 

Calls for Papers Is a weekly alerting service that lists all 
"Calls" for scientific and technical meetings In a standardized and re- 
producible format. 

Researoh and Miorofilm Publications (RMP) provides monthly In- 
dexes, with corresponding microfilm or microfiche, to all the transla- 
tions (about 250,000 pages per year) of the U. S. Joint Publications Re- 
search Service. Even though all documents are also available from the 
Clearinghouse for Federal Scientific and Technical Information, our In- 
dexed and packaged service sells widely to research libraries and gov- 
ernment agencies In both the U. S. and abroad. Until recently, the RMP 
service was organized and Indexed solely on a geographic basis. This 
fall, we started a new service that Indexes separately all scientific 
and technical material, regardless of country of origin. 

Current Index to Journals in Education, Inaugurated In 1969, 

Is the result of an Innovative partnership between the ERIC network of 
the U. S. Office of Education and ourselves as representative of the 
private sector. The program Indexes the newly published educational 
serial literature and includes a monthly journal, semi-annual and annual 
hardbound cumulations, full computer-tape publication, and plans for a 
broad spectrum of specialized subsidiary publications. The publication 
contract was let under standard competitive-bidding procedures and em- 
bodies the concept of a phased transfer over a period of years, from 
partial government subsidy to complete support by the publisher. 

Through partnership wi th a London-based firm, Information Re- 
trieval Ltd., we offer Biomdioal Abstracts, in seven subject areas: 
Microbiology, Virology, Genetics, Aquatic Biology, Calcified Tissues, 
and Entomology. Publication is monthly with an annual index. 

Our National Auxiliary Publication Service Is a unique micro- 
fiche publication program operated in conjunction with the American So- 
ciety for Information Science. Its purpose is to make available in mi- 
crofiche Important material supplementary to basic articles published in 
the professional journal literature. Supplementary material Is received 
from journal editors or authors, microfilmed, indexed, and assigned an 
accession number. This information is printed in the journal with the 
summary article, and interested parties can obtain the supplementary 
data for the modest fee of $1 per microfiche. 



New Approaches 



We feel that the commercial Information industry has embarked 
on several programs that either offer or promise progress in scientific 
publication. In some cases, this progress Is conceptual; In others, it 
is more simply the belated application of private sector Initiative to 
problems that are as old as awareness of the information explosion itself. 



A notable example Is the Current Index to Conference Papers 
data base. The fact that It Is published not only In print but In 
weekly computer- tape form In a standard format Is something of an In- 
novation, and hopefully represents publication progress. Much more 
significant to the future of scientific and technical publications, 
though, Is the basic fact that for the first time, Information on the 
con fere nee- paper literature from over 2000 worldwide scl/tech meetings 
per year Is being made widely available. Each year we expect to Index 
about 150,000 papers. Based on current statistics, about 80 percent of 
these will eventually be published In the professional literature, but 
publication frequently won't occur for a year or two or even longer. 

Recent research at the Center for Scientific Communication at 
Johns Hopkins has established that, for active research scientists and 
engineers, the papers delivered at conferences are a more important 
source of Information than the published literature. Current Index to 
Conference Papers may Introduce a demand that could provide the Impetus 
for dramatic changes In scientific and technical publication practices. 

Instead of the 8 or 10 papers now provided monthly by the typ- 
ical professional journal, why not have one that would offer abstracts 
of 40 or 50, and Include the full text of each on microfiche in a pocket 
on the Inside of the back cover of the publication? Outwardly, the 
publication would appear much like those produced now; in print would be 
news articles, letters to the editor, the 40 or 50 abstracts, and of 
course sufficient pages of advertising, yet information on Important re- 
search in the field would be reaching subscribers at a four- or five- 
fold accelerated rate. 

Early next year, we hope to initiate a program that will offer 
conference papers on microfiche through arrangements with sponsoring so- 
cieties or authors. Audio tapes of key papers may also be offered. 
Having created the index, we now view our next job to be providing the 
substance itself. 

Another innovation in our program to provide secondary infor- 
mation about research in science and technology Involves our plans to 
offer specialized subject- field current- awareness services. The diagram 
on the following page represents a time-scale continuum of public infor- 
mation about scientific and technical research. For information to be 
most useful, it must be organized to the needs of the individual. Thus 
our planned Current Awareness Publication Services (CAPS) are designed 
to provide the research specialist with all the secondary information 
he needs about who may be, is doing, or has published what. We feel 
these CAPS services may be a more practical way of filling sci/tech 
current awareness needs than individual SDI services that are expensive 
now and offer little promise of a marked drop In price even with in- 
creased numbers of users. 

Other plans include enlarging our coverage of the foreign lit- 
erature and making primary data files available and usable in computer- 
tape form. We believe tremendous opportunities for progress exist 
through constructive associations between government and the private 
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sector. New information systems frequently need massive investment to 
get them underway, not uncommonly an investment greater than can be un- 
dertaken by a single commercial organization. But the Information sys- 
tem, after it has been created, can profit from the publisher’s art-- 
packaging, advertising, selling, and customer education. I can only 
hope that pregrams similar to the one we have underway with the Office 
of Education will be developed by other government agencies. 
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SOME CONSIDERATIONS CONCERNING IMPEDANCES IN THE 
SCIENTIFIC COMMUNICATION NETWORK 



We assume that the forcing function of the national research 
and development effort stems from the demands of society for satisfac- 
tion of Its felt needs towards achieving better health, security and 
material well-being, and that these desires will continue unabated In 
the foreseeable future. 

In the final analysis, the nation's effort Is’ kept from the 
greatly Increased productivity that society hopes for, for a number of 
reasons. Basically, the R and D process Is a creative one, admittedly 
not too well understood. It Is hard to accelerate Its accomplishments 
by simple additions of manpower and other resources. The secrets of 
nature and the elements of discovery and Inventiveness apparently can- 
not be forced. 

In addition, tho scientific communication system, which serves 
to provide the Important multiplier effect In R and D by permitting 
workers to profit from the knowledge and experl ent. of co-workers, con- 
tains Impedances In Its networks, some of which may have hard cores of 
Inertia which will be extremely difficult to remove,!' and others whose 
amelioration Is possible but which will require much greater concern and 
effort on the part of all Involved. 

I will simply outline the latter elements and then discuss 
them In detail sequentially. In particular, the role of the federal 
Government, as a principal agent of responsibility for society In this 
endeavor, will be discussed. 



Topics of Relevance to the Impedance Issue 



a. The business aspects of R and D. 
1. proprietary Issues 



XJ The engineer will recognize that the removal of all Impedances In 
these positive feedback loops would lead to a divergence or true explo- 
sion In the overall effort. One can be consoled against this happening 
by recognizing that as long as people are Involved In the effort, there 
will always be some Inertias. 



2. competitive nature of industrial activity 

3. commercial Issues In Information handling 

4. property aspects of information 

5. Information as a resource in the R and 0 
process 

b. Secrecy, national security, and International rivalry. 

1. need-to-know I? 

2. proper balance 

3. one-world of science vs nationalism 

c. Language problems. 

d. Mores of the scientific community. 

e. Time delays In various aspects of the communication 
problem. 

f. Human factors, e.g., Informal practices, limitations 
of the Individual - In assimilation, etc. 



Business Aspects of R and D 



No one can really understand the scientific communication pro- 
cess unless he first appreciates that In recent times we and other na- 
tions have adopted a very pragmatic view towards our scientific and 
technical establishments. Thus science and Its twin, technology, have 
become a big and vital bueineee. 

1. It Is clearly recognized that scientific Information 
plays a significant role In this business process and for that 
reason It Is treated as proprietary. The fact, however, that 
so much of the R and D process Is sponsored or supported by 
public funds demands that Its results be placed In the public 
domain as soon as possible. 

This area Is close to that of Patent Policy and must 
be handled in concert with It (cf. the recent FCST recommenda- 
tions on this subject)*, however, It should be pointed out that 
the fraction of Important new scientific information that is 
being treated as proprietary Is Increasing. Its expeditious 
rel ease must be insisted upon by government . 



2. In spite of the fact that U. S. Industry Is consider- 
ed to be "all on the same team" with regard to major national 



undertakings like the Defense program, the Space program, 
H.E.W. , etc. --we operate through a competitive industrial 
base. An Internal data base of scientific Information and 
know how is often considered a principal asset of one group 
In competing with its competitors. To that end, crucial In- 
formation Is often withheld or written around in scientific 
papers or technical reports. This situation can only be 
changed by conscientious project officers who seek to Improve 
the quality and completeness of the reports issued on their 
programs. 



3. While It has become recognized that the communica- 
tion efforts should be considered as elements In a eyeiem , 

It Is well known that various portions of the effort are 
managed by different groups with different motivations and 
practices. A most Important portion. Including valuable pri- 
mary and secondary materials has come under the control of 
commercial Interests. While recognizing the free enterprise 
assets as Important It must still be the concern of the 
scientific community and the government to oversee the per- 
formance of this portion of the system. Both the elements of 
journals and books are too Important In the overall system to 
be allowed to deteriorate In any way without Interfering with 
the Integrity of the system. 

Likewise, concern for the viability of other aspects 
of the private sector performance must be observed. 

4. Information Is Intellectual property. Its proper 
protection (copyright, etc.) serves as an incentive to further 
efforts In the field. At the same time advances In repro- 
graphy and machine handling of Information permit new Improved 
ways of exploitation and personal access to Information. A 
fair solution must be found In the Immediate future to the 
proper economic and Incentive support and use of Intellectual 
property. 



5. Information Is a resource to the R and D process. 

The proper financial support to this resource and Its relative 
priority In consideration with other resources must be further 
addressed. Furthermore, the measures of effectiveness for 
alternative elements In the communication process deserves In- 
creased attention. 



Secrecy 



Compartmentellzatlon and need-to-know restrictions are Insti- 
tuted to purposely restrict Information flow to avoid compromise of In- 
formation to the enemy. Naturally they serve to keep Information from 
other programs In our own efforts which might profit from this know- 
ledge, Including other governmental programs and other national goals. 



There exist no opportunities to judge the relative merits of wider dis- 
semination other than within Individual Agencies of government. Some 
eminent persons have called for a "wider forum" In which to consider 
these issues. I consider this an extremely important issue which so far 
has apparently escaped the concern of the Chief Executive's staff. 

In addition the integrity of science and effective technology 
as8e88ment are severely handicapped by these restrictions. The recent 
report of M. Mead's AAAS Panel and Its predecessor group are worth not- 
ing in this regard. 



JACK SIEBURG 



Office of the Judge Advocate General, VSAF 
LITE Activity 
Denver, Colo. 



INFORMATION POTENTIALS OF FULL-TEXT 
COMPUTERIZED RECORDS 



Many approaches are being studied today to determine ho* best 
to take advantage of new Information storage and manipulation devices. 

At LITE (Legal Information Thru Electronics), we convert the full text 
of documents to machine-sensible media, that is, we store every word of 
the original body of text material. The current LITE master file con- 
sists of approximately 75 million words of text and includes such major 
texts as the United States Code, the published and unpublished Decisions 
of the Comptroller General, and the Aimed Services Procurement Regula- 
tions. Our design philosophy is that, with full text, we have virtually 
unlimited possibilities for generating information product subsets and 
management aids. 

The LITE system presently operates under contract on an RCA 
Spectra 70/45. Peripheral equipment includes a high-speed printer, card 
reader and punch, 800-bpi magnetic tape drives, and disk and magnetic 
card immediate access storage media. All text and word locator files 
are stored on magnetic tape for processing, under a tape-oriented search 
and retrieval system, and for emergency backup purposes. Files con- 
sidered to have major research value or that are subjected to a high 
rate o* activity are also stored on immediate access devices and pro- 
cessed under a random-access system. 

For search and retrieval operations, LITE uses an inverted 
file of word locators which identify the exact location of every non- 
trivial word in a particular body of text. Word locators are not de- 
veloped for common or subjective words such as the, and, but, but their 
position in the text is accounted for in developing the locators for the 
nonconmon words. The word locator information is contained in a compu- 
ter-generated 12-byte fixed-field code. 

LITE programs incorporate several techniques for restricting 
the anount of output. Among these are limitations that 1) require that 
specified word relationships occur only in (or do not occur in) certain 
types of text, 2) documents identified for retrieval are current and in 
effect, or 3) were effective during specified time spans. The first 
capability results from the use of type codes and include/exclude type 
instructions. Control data for the last two techniques are contained 
in a machine-generated control record that precedes each text document. 

Documents identified for retrieval are available in one, or a 
combination, of three types of printed output: context, citation, or 

full text. The first, context, produces three lines of text showing how 



the search words were used in context, the words requested appearing in 

the middle line. The citation output tells where, in what specified 

document, the words requested may be found; for example, the page within 
a volume of Deoieions, The last output produces a print of the entire 
text; normally, due to print- time factors, this method is not used if 
the person requesting the search has access to the printed material. 

LITE file maintenance operations cover two distinct types of 
data banks, those that are updated by adding new material to the end of 

the present file, and those that must be updated by the insertion of 

revised or new material. For the former type files, new material is 
assigned document numbers in ascending sequence and the master file 
grows until it reaches a workable limit of approximately 7.5 million 
words; at this point, a new file is started. For files that require 
revisions, i.e., the U. s . Code, three techniques may be employed. The 
file maintenance techniques for updating the master text files also 
simplify updating of the master word locator files. Word locators are 
developed for the new material and are then merged with those developed 
for the old master file. 

Specialized indexes prepared by the LITE system have proved 
extremely valuable. The term eeleotive in Selective Key Word In Context 
Indexes may refer, in the LITE system, to keywords (words for which in- 
dex material will be developed) or stop words (words to be excluded from 
the final index). The final index print routine can be directed to de- 
lete additional stop words and/or phrases. The LITE system also has the 
capability of providing index material on magnetic tape, properly for- 
matted for photocomposition purposes. Formatting capabilities include 
the use of light and bold-face type, indentation, use of the leading 
factor and, in some Instances, the Inclusion of upper/lower case. 

Another type of specialized Index is the cross-reference or 
citation index. In the development of this type of Index, one body of 
text is electronically processed to locate all references (citations) to 
another specific body of text (e.g., citations to the Court of Claims 
cases appearlnq in the Deoieions of the Comptroller General), or to an- 
other section of the same body of text (e.g., references in Title 10 of 
the 'J . S. Code to Title 5 of the Code). The printed Index may be se- 
quenced in either citing or cited reference order. 

Another special index developed by LITE Identifies all Deoi- 
eione of the Comptroller General that have been overruled or otherwise 
modified. The system, consisting of three separate phases or programs, 
takes advantage of the man-machine interface. Computer analysis first 
locates all decisions that have been referenced by subsequent decisions, 
the original computer output consisting of data Identifying the modified 
and modifying decisions, In both printed and card format, plus a con- 
textual print of the modification action. Then, the type of modifica- 
tion action (overruled, modified, distinguished, etc.) Is entered by 
people, appropriate codes are gang-punched Into the data cards, and the 
reference card deck is processed to magnetic tape and sorted In ascend- 
ing sequence on the modified decision reference factor. The final Index 
is then formatted and printed out. 
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One of the special products available from LITE is text edit- 
ing. The original text editing routine was designed specifically to 
assist management in the complete revision of an existing regulation. 
Prior to preparing a draft of the revision, the LITE staff designed and 
developed a computer application that 1) locates and identifies (under- 
scores) all suspect words and 2) lists the proposed substitute word or 
words in the right-hand margin. (A suspect is a word that may be super- 
fluous, ambiguous, or for which an authorized abbreviation might be sub- 
stituted). Use of this editing technique in the revision of the regula- 
tion resulted In a size reduction of approximate 1 )' 40 percent, improved 
clarity, and a higher degree of standardization. 

Future plans for the LITE system include a redesign 1) to im- 
prove and expand search and retrieval operations and 2) to incorporate 
remote access capabilities and reformatting of files. The use of micro- 
form for storage of data bases and index data is being studied. Present 
plans envisage copies of the files being available at regional sites 
with the required scanning and output media; index data could thus be 
available at a much reduced cost with a minimum of storage requirements. 
Transmission of information over telephone lines is also being studied. 
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EXPLOITATION OF FILES FOR INFORMATION PRODUCTS 



The purpose of this paper Is to consider some of the products 
and services that can be derived from machine-readable and other files 
such as are generated by discipline- oriented abstracting and indexing 
services (hereafter abbreviated A/I services). The products and ser- 
vices that can be produced from presently available files will be con- 
sidered first. Next, the use of personnel directories and other files 
that can be merged with A/I files to produce new services will be dis- 
cussed. Finally, some suggestions will be made for the Inclusion of 
additional data elements In A/I files that promise to Increase their 
utility as a source of new services. The time scale Is the near future, 
and most of the products are printed publications or computer printouts. 

A study by Herner & Co. of the Psychological Abstracts file 
considered economic factors and probable markets In the development of 
one representative list of such products. The following were Identified 
as available from machine- readable abstracts and Indexes: 1) cumulative 

Indexes, 2) computer tape copies of the A/1 file, 3) special Interest 
subsets of the A/I publication, 4) continuing bibliographies for mission 
or multldlsclpllne oriented users, 5) topical bibliographies, 6) Indexes 
for primary publications, and 7) directories of research activities. Of 
the foregoing, only the last two can be considered Innovative or unus- 
ual. Since these products are conventional, however, one can extrapo- 
late to their use with some confidence. 

Based on an hypothesis that local directories of research have 
value to researchers In a given geographic area, a list of entries In 
Psychological Abstracts that emanated from Washington, D. C., Institu- 
tions was complied. Questionnaires were sent to local authors and re- 
sponses were used to supplement the computer-generated Information. A 
directory was printed and distributed for evaluation purposes. Re- 
sponses Indicated that local directories might well serve some Informa- 
tion needs. Our experience suggests that useful directories can be 
generated directly from A/1 files. It Is Important to note, however, 
that a small percentage of the professionals In a field produce most of 
the scientific literature. Thus, while there are approximately 30,000 
psychologists 1*» the U. S., less than 10 percent produce between 70 and 
80 percent of the literature. Similarly, a limited number of Institu- 
tions are sources of most of the published research, and a limited 
(approximately 20 percent) nunber of the journals provide most of the 
relevant materials. Furthermore, there Is a substantial correlation be- 
tween the most productive authors and the most productive Institutions 
that, In turn, tend to publish In the core Journals of the discipline. 



The core file for psychology will thus contain Information on about 
2000 to 2500 authors, 50 Institutions, and 200 journals. This can no 
doubt be generalized to other fields. 

This suggests that a comparative analysis of an author file 
with the classification schedule In a field would provide basic Infor- 
mation on the major areas of a discipline In terms of Its current liter- 
ature. For example, there appear to be approximately 120 principal 
areas in psychology. An Innovative use of the directory file was In the 
planning and programming of scientific meetings and conventions. 

Surveys of the users of A/ I services have frequently Indicated 
that users have difficulty In obtaining noncore, or peripheral, litera- 
ture, particularly foreign technical reports and monographs. Since A/ I 
services maintain current addresses for sources of technical reports and 
monographs, and less well known journals, this Information could be In- 
cluded In A/1 publications. Discipline-oriented A/1 services could also 
collect dlfflcult-to-obtaln documents and, for those not limited by 
copyright, give them to a repository, noting the repository accession 
number In the printed A/I journal. 

Personnel directories, subscription and membership files, also 
appear fruitful sources of Information. It appears that they can serve 
three main purposes: (a) They hopefully provide accurate Information 

on addresses, of Immediate concern In publishing local directories, etc. 
(b) They provide a basis for Improving current author Indexes. Most 
author Indexes are name rather than author Indexes, since publications 
only list authors' Initials. From affiliation and other supplementary 
data, It would be possible to more accurately Identify an author, (c) 
They often contain Information on Individuals' subject matter Interests. 
The potential utility of such Information for SDI purposes Is obvious. 

Finally, I would like to suggest a procedure for adding a 
major Information Item to A/I files, namely, citation Information. At 
present, the addition of such Information Is costly due to (a) lack <jf 
standardization In bibliographic format with the result that a variety 
of formats must be normalized, and (b) the character space required for 
references. The suggestion Is that A/I publications offer a service 
to editors of selected primary Journals to check references given In 
manuscripts against their cumulative files. For references for which 
abstracts have been published, the A/1 service would furnish the ab- 
stract accession nunier that would then be printed as part of the cita- 
tion In the primary publication. The editor of the primary journal has 
thereby been assisted through a check of the cited references. (This Is 
not unimportant, since a substantial number of references appear to be 
Inaccurate.) The cost to the primary journal Is the Inclusion of ab- 
stract accession nunfcers In the printed references. The reader of the 
primary journal will know lirvnedlately how to locate an abstract of the 
reference. One clerk Is likely to be able to provide reference checks 
for 10 to 15 journals. An estimate for Items In Psychological Abstracts 
Indicates that this procedure would permit coverage of 80 to 85 percent 
of the journal articles cited, 50 percent of the bocks, and 25 percent 
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of the technical reports. Gaps In prior coverage could also be Identi- 
fied through this procedure. 

It Is somewhat difficult to anticipate all of the potential 
uses of citation Information. It is evident, however, that citation 
patterns provide an ability to discriminate the active from the Inactive 
literature. 
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IMPROVING USER ACCESS TO INFORMATION THROUGH 
APPLICATIONS OF TECHNOLOGY!/ 



. * j ,y. IT s ProJect ^trex is an experimental, pilot-model, machine- 

Ibrary system. The system includes a computer-stored catalog 
of 0,000 journal articles in selected fields of materials science and 
engineering, and the full text of the 10,000 articles on microfiche. 

The catalog is contained in a general-purpose, time-shareu computer and 
is accessed through specially designed alphanumeric consoles, one of 
which has been implemented to date. The consoles are connected to the 
central computer through a buffer/controller. The full-text microfiche 
collection Is accessed through the alphanumeric console and the access 
facility is designed to provide guaranteed, rapid access to any document 
in the collection at locations that are remote to the store. Full text 
may be viewed, page by page, at the user's station by means of a sep- 
arate storage-tube display, or a permanent copy may be obtained either 
on 35-mm. film or as an 8 1/2 x 11-in. print. The elapsed time from the 
ordering of a 35-mm. film copy to the availability of it at the film 
station is approximately 90 sec. The first page of an article appears 
on the storage-tube display within 7 sec. after an order is placed, and 
each succeeding page can be obtained within 3 sec. 

Intrex will use this experimental library as a means of gain- 
ing insights into the design characteristics of large-scale operational 
systems of a similar kind. We are In the process of evaluating the 
merits of the system by making It available to a selected community of 
users who have a bona fide need for the Information contained In the 
system, and to librarians who may wish to use It for reference purposes. 
It is our intention to alter the characteristics of the system as we 
learn about its strengths and weaknesses from our user community. 

It should be noted that the system brings the library to the 
user; it circumvents his need to go to the library for the Information 
he is seeking. To be acceptable as a working system, however. It must 
engender satisfaction from the viewpoints of conpleteness and relevance 
of the information retrieved; It must be easy to engage; and It must 
provide quick, reliable service at costs that are realistic. 

Intrex is examining several Issues with respect to In-depth 
cataloging and the extent to which in-depth cataloging is needed when 



17 This summary Is a condensation of a paper by Professor 
J. F. Reintjes, "System Characteristics of Intrex," 1969. 



guaranteed, rapid access to full text Is, and Is not, readily available 
at the user's station. As many as 52 different Items of Information are 
being entered for each journal article cataloged. Through monitoring of 
the frequency of requests for each Item we shall be able to draw conclu- 
sions on the relative value of each Item. 

The possibility exists that easy access to full text may alter 
the behavior patterns of those using the catalog. Since some of our 
catalog information is frequently contained on the first page of the 
document itself (author, author's affiliation, journal name, volume num- 
ber, page, abstract, and so forth), users have the option of obtaining 
this information either by retrieving It from the catalog or from the 
document Itself. Our purpose is to Investigate the factors which govern 
user behavior when seeking information of this kind. 

For storage of full text we have chosen image storage on mi- 
crofiche. Computer storage of full text was discarded because of the 
huge amount of storage required. The use of film ensures preservation 
of pictorial information, and microfiche Is well -suited for journal-ar- 
ticle-type literature. We are using COSATI standard microfiche with a 
reduction ratio of approximately 18 to 1 and sixty frames per micro- 
fiche. Since our documents are derived from the published literature, 
we store text in image form rather than digitally because of the ease 
with which page text can be converted to a microfilm image. 

The full-text storage and scanning unit, called the central 
station, is time-shared by several receiving stations. In order to min- 
imize the time each microfiche is out-of-storage, each frame on a micro- 
fiche Is scanned only once by means of a flying-spot scanner. Video 
signals are transmitted as analog information and to each frame of 
transmission is added a receiving station address in digitally encoded 
form. Single-frame transmission requires storage at the receiving sta- 
tions; this storage is photographic in the case of the film station and 
electronic in the case of the storage-tube receiving station. 

Our investigations show that at least 2,000 scan lines are re- 
quired to reproduce the stored images with acceptable legibility. Var- 
iations in type sizes and quality of printing among documents dictate 
this scan-line requirement; in fact, as many as 3,000 lines may be 
needed where the quality of print is marginal and small type size is 
present, as in the superscripts and subscripts of mathematical et? 1 
tions. 



The alphanumeric- console system through which the cata 
accessed has been developed as an experimental vehicle to detem 
special attributes which should be included for user convenience 
brary applications and to investigate techniques which might be eiv J 
to minimize cost of such consoles. The salient features of the c 
console system are these: typewriter keyboard input; refreshed t 

play output; and a buffer/ control ler, consisting of a 128-track 
storage device and a Varian Data Machines 6201 computer, interpo 
tween the central computer and the display consoles. The buffer/ 
troller is capable of serving a maximum of ten display terminals 



addition to the keyboard, a set of programmed and programmable buttons 
is being provided in an effort to determine the usefulness of this 
approach as a user aid. 

In the original system design, catalog information and full 
text are displayed on separate CRT's. This configuration Is obviously 
Inconvenient and costly; a single display is a goal of our project. 

A critical Item with respect to utilization of a machine- 
stored library Is the amount of preparation needed to engage the sys- 
tem and to make full use of Its power. The bona fide user Is Interested 
only In satisfying his need for Information as completely and quickly as 
possible, and with a minimum amount of preliminary effort. Nevertheless 
certain procedural matters must be mastered even before intelligible re- 
sponses can be derived from the machine. These Include an understanding 
of the options the user has for making searches and typing procedures 
for executing these searches, as well as an ability to type and to in- 
voke proper procedures for correcting typing errors. Since many library 
users are occasional or Intermittent users, they may always be in the 
category of new users in the sense that their retentlvlty of basic oper- 
ational rules from system engagement to system engagement may be mini- 
mal. User aids thus become a crucial item. 

Thus far, Intrex has experimented with several types of user 
aids. A User Guide which describes in detail the various features of 
the system and how to use them is available both off-line in hard-copy 
form, and on-line as a computer printout. In addition, the Guide is 
available in summary form in a separate booklet entitled How to Get 
started. Wall charts describing system operation are also posted di- 
rectly before the user above his console. Still to be prepared and 
tested are simple instructions on cards. 

The results of our experiments to date indicate that the off- 
line Guide is the version referred to mcst frequently; apparently, the 
time required to print out the various sections of the on-line version 
is considered to be wasteful. Nevertheless dissatisfaction with the 
off-line Guide has been expressed. Dissatisfaction seems to result from 
the large quantity of material it contains, the time required to assimi- 
late the material, and the fact that the phraseology includes a certain 
amount of technical jargon which is not understood. A further observa- 
tion is that a summary version of the Guide is helpful, but it must be 
supplemented by the full-scale version, finally, response to wall 
charts as user aids has been disappointing thus far. Further experi- 
mentation with their content is needed in order to determine their value 
as a user aid, if indeed they have value. 

To date experiments have been conducted only with a typewriter 
console as an input-output device. A new dimension to user-aids is 
added when a graphical terminal becomes available. Since its writing 
rate is an order of magnitude faster than that of the typewriter, cur- 
rent disinterest in the on-line version of the instructional guide may 
disappear. Furthermore, since the Intrex graphical console is being de- 
signed with a set of programmable switches, these may offer a wholly new 



I 



approach to on-line Instruction when these switches are programmed as 
user aids. 
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PROPOSALS FOR ENHANCING THE HANDLING OF INFORMATION 



Though the "publish or perish" credo may not have been an 
overt driving force three and more decades ago, publication has always 
been a modus operandi for the person who discovers, or thinks he has 
discovered, new knowledge and wants to share It with his fellow man, 
be this drive for monetary gal n s self gratification, or more selfless 
purposes. 



Reasonably generous support of the scientific and academic 
communities over the past two decades, together with Increasing appli- 
cability of a new tool, the computer, to Information processing have 
perhaps more than addltively combined to create an Impressively large 
volume of current publication and exponentially increasing growth rates. 
The volume Itself Is having somewhat of a sobering effect and the com- 
puter gives evidence of some labor-saving assistance, but neither offers 
the promise of stemming the rise In the growth of quantities of infor- 
mation that are being generated today, and will continue to be gener- 
ated. 



It should be noted that the generation of information, per se, 
is scientifically desirable. It is through new knowledge--discovered, 
pondered, digested, interrelated with other knowledge, some new and 
some old--that science has contributed so remarkably to the medical and 
technological states of well being our society enjoys today. 

It is, in fact, because we consider information so valuable 
that we don't want to lose it that we— both the producers and users of 
information--have been trying to develop innovative ways of packaging or 
containing and presenting information, ways of compressing and digesting 
it to keep it within the human overload threshold, ways of selectively 
disseminating and communicating it, etc. 

One alternative to the information problem is, of course, to 
stop it at its source, that is, to halt or slow down its production. 

This obviously would impair goals for our society that anticipate ever- 
greater incorporation of science-based advances into the working and 
personal lives of our citizens. It must be realized, however, that loss 
of information once it has been produced is at least as wasteful, if not 
more so, than not having produced it at all. 

Imagination has been at work over the past twenty years in the 
handling of information. We have seen the rise of secondary publica- 
tions, principally the useful abstract periodical with its associated 
indexes, and the mushrooming of information centers that have been func- 
tioning as libraries for scientific materials and as services offering 



bibliographic, reference, and sometimes more extensive aids to the In- 
quirer. Publishers have stimulated the preparation of many monographs 
that have tended to keep reasonably current with new Information, and 
the critical ‘'evlew and the annual review have grown in popularity and 
recognized Importance for their tutorial and summarizing values. Less 
formal communication has also been on the upsurge. The shortcuts to 
Information transfer attendant In corridor and off-the-cuff discussion 
have heightened the significance of the meetings of many professional 
societies. Special -Interest groups, some formally structured and others 
more or less ad hoc, have also exploited the conference device (usually 
restricting the attendance) as well as exchanges for Informal publica- 
tions (newsletters, correspondence, pre-publication papers, etc.). 

It would be unjust to Ignore the contributions that all of the 
primary and secondary publishers and producer and user groups have made. 
These contributions make the present scene crowded but not chaotic. One 
can, albeit laboriously, locate journals, papers, data, persons respon- 
sible for particular Ideas. One can even not know who or what one Is 
looking for and find some measure of satisfaction in a response If one 
Is willing to wander through the existing people/publications complex 
long enough. This, It Is contended, Is a better- than-nothlng situation, 
and It often works reasonably efficiently. It does, however, rely too 
much on the inquirer's patience, his knowledge of the peopl e/publ 1 ca- 
tions complex, and the patience and good will of those whom he Inter- 
rogates as well as their own knowledge of the complex. 

Is the Imposition of a high degree of order within information 
transmission channels and on their interrelationships the basis for a 
solution to all information handMng problems? It is believed that 
better order would, indeed, at least partially alleviate some difficul- 
ties that presently exist in locating information sources, understand- 
ing scopes and overlaps and areas of responsibility, etc. It is also 
believed that better order might tend to Identify overlaps and gaps in 
coverages and services and might lead to mutually reinforcing coopera- 
tive arrangements that do not now exist. The source of the order could 
be a seat of guidance that has both an overview and an objectivity and 
that could also perhaps function as an arbiter. 

What, then, must accompany better order? It is suggested that 
a more realistic appraisal is needed of not only the complex and the 
things (computers, journals, libraries, services) in the complex, but 
also of the people in the complex, all of the people. It is contended, 
for example, that people who want information are not all the same, that 
is, they do not all behave in the same way. It is obvious, of course, 
that the financial manager and the technical director and the bench 
scientist are likely to want different items of information that could 
all pertain to the same entity In a more generic sense. It is less ob- 
vious but can be demonstrated that two bench scientists or two financial 
managers might also want items of information packaged in different ways 
in accordance with their own idiosyncratic preferences for processing 
information. Can one say, idiosyncratic preferences, humbug? Perhaps. 
But which are humbug, and which might be importantly significant to the 
scope of the processor's enterprise, that is, which had better be paid 



attention to because they are Important to the man's productivity? Dif- 
ferent behaviors, different purposes, different Information needs, dif- 
ferent time priorities, different levels of experience, different geo- 
graphic locations--al 1 these and more bear on what Information may be 
appropriate for a particular person at a particular time In a particu- 
lar place. Automation may be able to accommodate some of these charac- 
teristics, people and machines may be needed for others. The mixes and 
matches have to be determined, and their determination Is part of to- 
day's exciting frontier toward more effective use of Information tomor- 
row. 
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EXPERIMENTS WITH RESPONSES OF USERS: PRELIMINARY FINDINGS 



A few background comments may serve to Introduce the remarks 
which will be made later in this paper. The Program of Research on the 
Management of Research and Development at Northwestern Is in its tenth 
year; it existed for four years prior to this at MIT, and had Its Incep- 
tion at Columbia U. The program is an Integral part of the organization 
theory area of the Department of Industrial Engineering and Management 
Sciences, and the training and experience of our faculty and advanced 
graduate students include a wide variety of disciplines within and out- 
side of engineering. Cur major approach is one of the development of 
fairly formal propositions followed by testing under field conditions 
in operating organizations. In the past five or six years, we have in- 
creasingly used field experiments In addition to study, and our long 
interest in the flow of ideas ^,id the communication process has provided 
a basis for several detailed studies of the information-related behavior 
of scientists and engineers. 

One of our earliest attempts to experiment with the informa- 
tion environment In an operating laboratory involved 18 X-ray crystal- 
lographers in nine Industrial and nonindustrial laboratories in the 
Chicago area. After a short period of observation, we provided them 
with seven services for another short period, then extended the services 
for a period of six months. Without going into the experiment in de- 
tail, and noting that the subjects were otherwise most cooperative, the 
point Is that they did not respond to their new services by making per- 
manent, observable changes in their searching behavior. 

In two related experiments, this time with medical research- 
ers, we provided the services of an experienced medical librarian. In 
the first, and smaller, experiment, we found some (at least temporary) 
changes in behavior. In the larger experiment, involving 12 groups of 
medical researchers, and starting out with 110 participants, we pro- 
vided both the 1 ibrar1an and facsimile equipment to six of the groups 
for 3 months, and then switched the service to the other six groups, ob- 
serving both groups during both periods. There were, as might be ex- 
pected, considerable variations in the frequency and characteristics of 
the usage, and we attempted to relate various personal, professional, 
job, and organizational characteristics to each individual's use or non- 
use of the system. 

Finally, two studies involved the actual use of documents. 

The first studied the usefulness of technical reports and literature 
arriving on the desk of a group of researchers; one of the things ob- 
served was that a researcher will call an item useful even if it does 
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not cause him to take some action. The second, and larger, study was an 
administrative experiment Involving 85 scientists and engineers at three 
government laboratories. Here, the purpose was to discover whether the 
presence of an abstract accompanying a document was related to shorter 
times and better decisions with respect to the initial disposition of 
the document, and the result was that there appeared to be no signifi- 
cant difference. 

We are involved in other investigations, one a longitudinal 
stucty to trace changes in style over a period of five years, but one of 
the points which our work suggests, and which we find in studying the 
research of other investigators, is that scientists and engineers do not 
respond to the mechanism--the systems and services that are provided--in 
neat, stable, easily categorized ways. 



Commentary 



In the face of the voluminous research literature on the in- 
formation process and the many programs to develop and test new infor- 
mation systems and services, it may seem difficjlt to suggest that more 
needs to be done. One of the things that we car do is to improve our 
own information process by some of the many devices we already have 
available, including more comprehensive reviews and citations and pre- 
vious work in our own reports, assuring that reference is made to where 
the complete report of our research and supporting data may be found, 
and cooperating with those who are attempting to organize and summarize 
the state of the art. 

That which seems self-evident is that the information process 
--sources, transfer mechanism, users--is pigheadedly pluralistic. If 
we are to make progress, we must recognize that the solutions we propose 
are, and should be, for the most part modest modifications of processes 
that are already in use in some form somewhere. One of these areas is 
personal files. User studies point out how ubiquitous they are. These 
vary from a modest stack in a corner to elaborate, machine- indexed sys- 
tems. There are, for example, a number of systems among those working 
on our Northwestern program-- alphabetic files indexed with 3x5 cards, 
serial files with computer-printed author-title indexes, title and ab- 
stract computerized files, etc. Further afield, individual and small 
group filing systems are endless, some competing with or even drawing on 
library systems. In all of these, we can find the expression of a real 
need for help. One obvious possibility is to find that minimum common 
denominator that would provide the basis for a degree of what in other 
areas is called standardization. Some minimum compatibility, some basis 
for a transform that might save those who are still trying to develop 
personal filing systems the effort, and allow those who have filing sys- 
tems to draw on one another, to communicate, to exchange, to build and, 
perhaps, to take advantage of the economies of common machine programs 
and information stores. In our program, we are working with a minimum 
identification system along these lines for our field data. 



This argument for a minimum compatibility stems from a recog- 
nition that even our largest and most puissant systems--DOC, Library of 
Congress, or the major journal s--have not only not brought conformity 
but have difficulty achieving recognition and acceptance. Perhaps it 
Is In order to spend more time exploring systems that do a minimum for 
the individual, that provide him with an opportunity to react to the 
system over the range from nominal awareness and the minimum identifi- 
cation of the document all the way to the richest and most variegated 
Interaction with all the various forms, including the whole document 
(and perhaps its gestalt -the author's vitae, the refereeing process of 
the publisher, etc.). 

It would seem that there are a number of relatively simple 
and mundane things which could be done to Improve the process. Most of 
these are already being done, but unevenly. For example, in the era of 
the photocopier, a full citation printed on the first page of each jour- 
nal not only saves time, but it also avoids the later look up and the 
too- frequent erroneous citation. Similarly, the title page or the over- 
leaf on reports could contain a standard citation, especially to provide 
a clue to handling Indicia like "Final Report ..." and the multi-line 
designation of the publishing organization. None of these present prob- 
lems to the professional librarians, but they do inhibit the individual. 
Similar comments apply to the title pages of books. 

From our stucty of the use of abstracts, mentioned above, we 
learned that the scientists and engineers did not find themselves over- 
burdened by having to handle documents that arrived at their desks. We 
also found that they drew on many sorts of cues, both within and out- 
side the immediate document, to determine whether the document was use- 
ful or not. They tended to read, or retain for reading, quite a few of 
the documents. It seems probable that their behavior might well result 
in physical storage problems, and we did not attempt to determine wheth- 
er they might have a retrieval problem. If, however, the presence of a 
document does not present an information overload, If the individual can 
deal with the sorting problem on terms at least acceptable to him, it 
may suggest that mechanisms, such as the interactive display, or the mi- 
crofiche, that would bring many documents quickly and easily to him, 
will be of central importance. And if we make a record of his interac- 
tions, and especially the patterns of his requests, we may be able to 
establish priorities in the development and implementation of the facil- 
ities, such as buffer libraries, various surrogates, and decision trees. 

In summary, these comments suggest that broad palliatives, or 
detailed formal systems may be inherently unproductive, at least until 
we learn far more about the process by which the individual determines 
what he wants and how he wants it. It is suggested that the variety of 
systems that people use may provide clues to those minimum systems that 
will be most adaptive, and most used. 
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THE ROLE OF INFORMAL COMMUNICATION 



A study In progress at BSCP deals with patterns of exchange of 
information through means other than journals. This area appears less 
readily amenable to technical solutions. Here we are concerned with 
direct and indirect interchange between individuals or a speaker and his 
hearers. This form of communication is probably the most frequently 
used; it deals mainly with smaller units of information (conversations 
and correspondence) and it is much more difficult to measure in quanti- 
tative terms. It is also a convenient refuge for respondents to surveys 
who prefer not to record a reliance on specific sources of information. 

Research on informal communication has been primarily descrip- 
tive. Some of the data available come from user studies made for other 
purposes. Little is known about the amount of information and how much 
of that amount is new, which actually passes from one scientist to an- 
other through informal means. Perhaps because of its amorphous nature 
and the facile expression invisible college the whole subject has tended 
to assume dimensions which may be misleading and unlikely to stand very 
close scrutiny. 

One illustration of an attempt to set in motion an Informal 
exchange and to observe its activity was the experiment sponsored by 
the National Institutes of Health. The experiment of an Information Ex- 
change Group, which lasted from 1962 to 1966, was extended to cover 
seven select groups of scientists. The IEG was indeed a novel concept, 
sufficiently costly to require a government sponsor with facilities and 
resources to support a substantial effort. Unlike studies which have 
sought to find generalizations from observing the real world, the NIH 
program was a true experiment in which the individuals who participated 
were formally engaged in the effort and the information exchanged was 
itself a part of the experiment. 

Briefly, the concept was that the mechanism of information ex- 
change would focus on a specific research topic and gather the relevant 
invisible college into group membership. In all, seven such groups were 
formed ranging in size from 250 to 1500 members. Group number one came 
closest to representing the original idea--its topic was Oxidative Phos- 
phorylation and Terminal Electron Transport . Even this topic brought 
750 members, either nominated by a member or admitted at their own re- 
quest. But by the time a seventh group was formed, the subject scope 
had broadened almost to a generali ty-- Nuclei c Acids and the Genetic 
Code . Altogether, 3600 scientists participated and of the original 
seven groups, four are still in existence but operate outside the frame- 
work of NIH sponsorship. 



Clearly, there are some conclusions that can be derived from 
this experience, yet even these are exceedingly weak on questions re- 
lating to why. scientists' Information gathering habits follow any par- 
ticular pattern. The experiment, however, was not conducted to yield 
answers to this question. Instead, it was aimed at trying out a vehi- 
cle for conveying substantive information and a mechanism for handling 
the information. In both respects it accomplished Its purpose, but our 
Interest now is In using the record of what took place as a means of 
getting Insights Into other aspects of information exchange. 

One obstacle to the collection of definitive data on the 
whole subject of communication is the unreliability of any form of 
questionnaire or Interview. Even the diary technique used by the Amer- 
ican Psychological Association is not free from distortion. The pre- 
paratory studies for the formation of the Lister Hill Biomedical Com- 
munity Network point out the dangers of poll -taking In a community 
characterized by a diversity of interest and sophistication. The Sys- 
tem Development Corporation in approaching the task of evaluating In- 
formation systems started with recognition of the fact that judgments 
of relevance were valid only when correlated with the users' purpose in 
seeking information. 

This same problepi, which assumes a somewhat different form in 
studies of information media (journals, preprints, papers, correspond- 
ence), shows up clearly in responses to different Inquiries. It would 
be necessary to assemble the responses to a considerable ser 2 s of 
studies to demonstrate the inconsistencies which seem to characterize 
their results. While such a hypothesis may prove false, it is possible 
that the apparent purpose of a survey has affected the responses to a 
degree that makes much of the data questionable. The kind of anomalies 
that may appear suggest that respondents may tend to record their pref- 
erences with two objectives: 1) to place themselves in the best possi- 

ble light with regard to keeping current to their special field, 2) to 
avoid a firm commitment to reliance on the information medium under 
study in the questionnaire or interview. 

These problems are perhaps more properly dealt with in studies 
of the sociological or psychological aspects of scientific communica- 
tion. But it is reasonable to assume that inconsistent findings are in 
need of explanation before poll-taking data can be considered adequate 
for decisions about improved mechanisms for information exchange. These 
inconsistencies show up in different kinds of studies. When a community 
of document system users (or potential users) is questioned, the results 
show that journals are a major source of information; when journal uti- 
lization is studied, a strong reliance on advance information from pre- 
prints and manuscripts is reported. When the information Exchange 
Groups (NIH) membership was questioned about the preprint exchange, 
many indicated a preference for receiving only titles or abstracts on 
the single topic which was the precise focus of group membership. 

When the results of surveys produce so little confirming evi- 
dence, it can scarcely be said that we know very much about the uses of 
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literature and Information In the scientific communities. One of the 
difficulties clearly lies In the tendency of aggregate numbers to merge 
different segments of a community Into a whole and to conceal their dif- 
fering interests. Studies addressed to a discipline are especially vul- 
nerable. Even mission-oriented groups are composed of smaller entitles 
distinguished by their functions. But an awareness of the many dif- 
ferent needs and uses cannot alter the fact that both publication and 
Information handling systems require large user communities in order to 
maintain their activities and are limited In their ability to address 
different services to segments of those communities. 

Since multiple services addressing the preferences of smaller 
and smaller interest groups Is clearly too costly to become the likely 
direction of future effort, it Is important to be certain that general 
purpose services are Indeed based on a clear perception of needs and 
patterns of recourse to Information. The same is true for the scienti- 
fic communities whose preferences and habits are being studied. Their 
own perceptions may need revision if any Innovations in the communica- 
tion process are to be developed. Consider briefly some of the findings 
from two studies. The first was made in 1969 when the Entomological 
Society of America was canvassed to collect data about the present use 
patterns and the media employed maintaining current awareness in the 
field. 



In response to a question about utilization of sources, 743 
respondents out of a total of 2334 reported extensive use of abstract- 
ing journals. Yet 1414 reported that they regularly utilized Biological 
Abstracts. The most often cited source extensively used to obtain in- 
formation was professional journals (1740 respondents) representing 74.8 
percent of the total. This suggests a significant reliance on journals, 
particularly when compared with the 31 percent who reported extensive 
use of abstracting journals. More than three fourths of all respondents 
expressed satisfaction with the sources available to them--both primary 
and secondary--and did not feel a lack of access to be a problem. This 
would be a gratifying response if it did not contrast with the scope and 
coverage of primary literature reported under the heading of core mate- 
rial. Of the more than 850 journals devoted to Entomology in the 
world's current literature, only 147 were reported as sources by the 
respondents. Either the community are unaware of the major portion of 
the journals devoted to this subject or that same portion of the litera- 
ture (75 percent) is so lacking in new or significant research material 
as to be unnecessary for publication. 

When we turn to the invisible college as the medium for ex- 
change, bypassing the journal and its time-lag problem, another study 
offers some curious findings. Admittedly an informal and unpublished 
study, still the outcome casts reasonable doubt on the role of this 
mechanism. Here, each scientist was asked to name the authorities in 
the field; the individuals names were then asked to make a similar list. 
It was hoped by this means to discover enough commonality to gain some 
insight into an invisible college. Instead, the results showed no over- 
lap at the first level and almost none at the second. 



Overall, these illustrations have been drawn to point to the 
anomalies both within and among efforts to study the exchange and use of 
information. A definitive comparison of many pieces of study data, like 
the user studies themselves, would probably lead only to predictable 
results. We can also venture some conclusions which can have an impact 
on the direction of efforts aimed at special solutions to special Infor- 
mation problems. 1) The results of extensive research Into scientists' 
literature and Information utilization practices are too imbalanced to 
serve as a basis for generalization useful In systems development. 2} 

If there is any real benefit in seeking more definitive information, 
then new methods and techniques are necessary. 3) The chief problem 
may lie in the momentum of publication, and if so, qualitative studies 
and some self restraint may achieve more than searching for technical 
solutions in the production of literature. 
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